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NOVEL GLUTAMATE RECEPTOR MODULATORY PROTEINS AND 
NUCLEIC ACID MOLECULES AND USES THEREFOR 

Related Applications 

This application claims the benefit of U.S. Provisional Application Serial No. 
60/257,589, filed December 22, 2000, the entire contents of which are incorporated 
herein by this reference. 

Background of the Invention 

Glutamate is the transmitter of the vast majority of the exicitatory synapses in the 
mammalian central nervous system (CNS) and plays an important role in a wide variety 
of CNS fmctions (for review, see Hollmann and Heinemann (1994) Annu. Rev, 
Neuroscl 17:3 1-108). The understanding of the glutamatergic synapse has advanced 
enormously in the last 10 years, primarily through application of molecular biological 
techniques to the study of glutamate receptors and transporters. There are three families 
of glutamate-gated cation channels termed ionotropic glutamate receptors (iGluRs); the 
N-methyl-D-aspartate (NMD A) receptor, the alpha-amino-3-hydroxy-5-methyl-4- 
isoxazolepropionic acid (AMP A) receptor and the kainate receptor. There are also three 
groups (based of their amino acid identity) of metabotropic, G protein-coupled 
glutamate receptors (mGluRs) that modify neuronal and glial excitability through G 
protein subunits acting on membrane ion channels and second messengers such as 
diacylglycerol and cAMP. In addition, there are at least two glial glutamate transporters 
and three neuronal transporters in the brain. 

The glutamate receptors are known to be important in modulating a variety of 
neuronal activities including, but not limited to, neural plasticity, neural development 
and neurodegeneration. The NMDA receptors and AMPA/kainate receptors act as 
glutamate-gated cation channels, whereas metabotropic receptors (mGluRs) modulate 
the production of second messengers via G proteins. Molecular studies indicate that 
NMDA receptors exist as multiple subunits (NMDAR1 and NMDAR2A-2D) and that 
and mGluRs exist as multiple subtypes (mGluRl-mGluR8). Moreover, splice variants 
have been found for at least three mGluRs: mGluRl, mGluR4 and mGluR5. 

The subtypes mGluRl and mGluR5 are coupled to the stimulation of the 
phosphatidylinositol-calcium second messenger system while mGluR2, mGluR3, 
mGluR4, mGluR6, mGluR7 and mGluR8 are coupled to G rpoteins that inhibit 
adenylate cyclase activity. The metabotropic glutamate receptors (mGluRs), for 
example mGluRl and mGluR5, can increase intracellular Ca2+ concentration via 
Ins(l,4,5)P3- and ryanodine- sensitive Ca2+ stores in neurons. Both types of store are 
coupled functionally to Ca2+-permeable channels found in the plasma membrane. The 
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mGluR-mediated increase in intracellular Ca2+ concentration can activate Ca2+- 
sensitive K+ channels and Ca2+-dependent nonselective cationic channels. These 
mGluR-mediated effects often result from mobilization of Ca2+ from ryanodine- 
sensitive, rather than Ins(l,4, 5)P3 -sensitive, Ca2+ stores, suggesting that close 
5 functional interactions exist between mGluRs, intracellular Ca2+ stores and Ca2+- 
sensitive ion channels in the membrane. 

With respect to the metabotropic glutamate receptor subtype 5 (mGluR5) two 
isoforms have been identified in the rat and human brain. Compared with mGluSa, 
mGlu5b contains an insertion of 32 amino acids, 50 residues after the seventh 
10 transmembrane domain. The primary sequence of the human receptors share about 93- 
96% idendity compared with the rat homologues. The deduced amino acid sequence of 
the large extracellular domain is extremely well conserved between rat and human 
C3 mGluR5 (98.6%) which suggests that the amino-terminal region of mGluR5 is 

functionally important. Expression of either human isoform in Xenopus oocytes evokes 
1 5 glutamate responses, which suggests that the two receptors from the human brain could 
activate phospholipase C and generate Ca(2+)-activated CI- current. 

Because of the ubiquitous distribution of glutamatergic synapses, mGluRs have 
h b the potential to participate in a wide variety of functions in the CNS. In addition, the 

wide diversity and heterogeneous distribition of mGluR subtypes provide an opportunity 
Up 20 for developing pharmacologic agents that selectively interact with mGluRs involved in 
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only one or a limited number of CNS functions. Accordingly, there exists a need to gain 
a detailed understanding of the specific roles of mGluRs in CNS function and to identify 
modulators of particular mGluRs, in particular, for use in developing novel strategies for 
a variety of psychiatric and neurological disorders. 

25 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of nucleic acid 
molecules which encode a novel family of secreted proteins having homology to the 
metabotropic glutamate receptor subtype 5 subfamily, in particular, to the N-terminal 
30 portion of the metabotropic glutamate receptors mGluR5a and mGluR5b, referred to 
herein as the metabotropic glutamate receptor subtype 5 modulatory proteins (also 
referred to herein as the "mGluR5M" proteins or "mGluR5M" family). The mGluRSM 
molecules of the present invention as well as mGluR5M mimics and/or mGluR5M 
modulators, are useful in regulating a variety of cellular processes. Accordingly, in one 
35 aspect, this invention provides isolated nucleic acid molecules encoding mGluRSM 

proteins or biologically active portions thereof, as well as nucleic acid fragments suitable 
as primers or hybridization probes for the detection of mGluR5M-encoding nucleic 
acids. 
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In one embodiment, a mGluRSM nucleic acid molecule is 60% identical to the 
nucleotide sequence shown in SEQ ID NO:l, the nucleotide sequence of the DNA insert 
of the plasmid deposited with ATCC as Accession Number PTA-2775, or complement 
thereof. In a preferred embodiment, an isolated mGluRSM nucleic acid molecule has 
5 the nucleotide sequence shown SEQ ID NO:3, or a complement thereof. In another 
preferred embodiment, an isolated mGluR5M nucleic acid molecule has the nucleotide 
sequence shown in SEQ ID NO:l, or a complement thereof In yet another preferred 
embodiment, an isolated nucleic acid molecule has the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number PTA-2775, or a 
1 0 complement thereof. 

In another embodiment, a mGluR5M nucleic acid molecule includes a nucleotide 
sequence encoding a protein having an amino acid sequence sufficiently homologous or 
identical to the amino acid sequence of SEQ ID NO:2 or an amino acid or an amino acid 
sequence encoded by the DNA insert of the plasmid deposited with the ATCC as 
1 5 Accession No. PTA-2775. In another preferred embodiment, a mGluRSM nucleic acid 
molecule includes a nucleotide sequence encoding a protein having an amino acid 
sequence at least 60% identical to the amino acid sequence of SEQ ID NO:2 or an amino 
acid or an amino acid sequence encoded by the DNA insert of the plasmid deposited 
with the ATCC as Accession No. PTA-2775. 
yy 20 In another embodiment, an isolated nucleic acid molecule of the present 

invention encodes a mGluR5M protein which includes an N-terminal mGluR-like 
domain and/or a C-terminal unique domain. In another embodiment, an isolated nucleic 
acid molecule of the present invention encodes a protein which includes a G protein 
coupled receptor family 3 consensus sequence. In yet another embodiment, a mGluRSM 
25 nucleic acid molecule encodes a mGluR5M protein which lacks a transmembrane 
domain. In yet another embodiment, a mGluR5M nucleic acid molecule encodes a 
mGluRSM protein and is a naturally occurring nucleotide sequence. In yet another 
embodiment, a mGluRSM nucleic acid molecule encodes a mGluRSM protein which 
lacks a transmembrane domain. 
30 Another embodiment of the invention features mGluR5M nucleic acid molecules 

which specifically detect mGluRSM nucleic acid molecules relative to nucleic acid 
molecules encoding non-mGluRSM proteins. For example, in one embodiment, a 
mGluRSM nucleic acid molecule is at least 30 nucleotides in length and hybridizes 
under stringent conditions to a complement of a nucleic acid molecule comprising the 
35 nucleotide sequence shown in SEQ ID NO:l or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number PTA-2775. In another 
embodiment, a mGluRSM nucleic acid molecule hybridizes under stringent conditions 
to a complement of a nucleic acid molecule consisting of about nucleotides 880-1823 of 
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SEQ ID NO:l . Another embodiment of the invention provides an isolated nucleic acid 
molecule which is antisense to the coding strand of a mGluRSM nucleic acid. 

Another aspect of the invention provides a vector comprising a mGluRSM 
nucleic acid molecule. In certain embodiments, the vector is a recombinant expression 
5 vector. In another embodiment, the invention provides a host cell containing a vector of 
the invention. The invention also provides a method for producing a mGluRSM protein 
by culturing in a suitable medium, a host cell of the invention containing a recombinant 
expression vector such that a mGluRSM protein is produced. 

Another aspect of this invention features isolated or recombinant mGluRSM 

1 0 proteins and polypeptides. In one embodiment, an isolated mGluRSM protein includes 
an N-terminal mGluR-like domain. In another embodiment, an isolated mGluR5M 
protein includes a C-terminal unique domain. In another embodiment, an isolated 
mGluRSM protein lacks a transmembrane domain. In another embodiment, an isolated 
mGluRSM protein has an amino acid sequence sufficiently homologous or identical to 

15 the amino acid sequence of SEQ ID NO:2. In a preferred embodiment, a mGluRSM 
protein has an amino acid sequence at least about 60% identical to the amino acid 
sequence of SEQ ID NO:2. In another embodiment, a mGluRSM protein has the amino 
acid sequence of SEQ ID NO:2. 

Another embodiment of the invention features an isolated mGluRSM protein 

20 which is encoded by a nucleic acid molecule having a nucleotide sequence at least about 
60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
99% or more identical to a nucleotide sequence of SEQ ID NO: 1 , or a complement 
thereof. Yet another embodiment of the invention features an isolated mGluRSM 
protein which is at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 

25 94%, 95%, 96%, 97%, 98%, 99% or more identical to a polypeptide having the amino 
acid sequence of SEQ ID NO:2. This invention further features an isolated mGluRSM 
protein which is encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a complement of a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:l. 

30 The mGluR5M proteins of the present invention, or biologically active portions 

thereof, can be operatively linked to a non-mGluRSM polypeptide to form mGluRSM 
fusion proteins. The invention further features antibodies that specifically bind 
mGluRSM proteins, such as monoclonal or polyclonal antibodies. In addition, the 
mGluRSM proteins or biologically active portions thereof can be incorporated into 

35 pharmaceutical compositions, which optionally include pharmaceutically acceptable 
carriers. 

In another aspect, the present invention provides a method for detecting 
mGluRSM expression in a biological sample by contacting the biological sample with 
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an agent capable of detecting a mGluR5M nucleic acid molecule, protein or polypeptide 
such that the presence of a mGluRSM nucleic acid molecule, protein or polypeptide is 
detected in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
5 presence of mGluRSM activity in a biological sample by contacting the biological 
sample with an agent capable of detecting an indicator of mGluRSM activity such that 
the presence of mGluR5M activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating mGluR5 
activity comprising contacting the cell with an mGluRSM protein, peptide, nucleic acid 

10 molecule, peptidomimetic or other small molecule such that mGluRS activity in the cell 
is modulated. In one embodiment, the mGluR5M protein, peptide, antibody, nucleic 
acid molecule, peptidomimetic or other small molecule inhibits mGluR5 activity. In 
another embodiment, the mGluRSM protein, peptide, antibody, nucleic acid molecule, 
peptidomimetic or other small molecule stimulates mGluR5 activity. In a preferred 

1 5 embodiment, the agent is an antibody that specifically binds to a mGluR5M protein. In 
another embodiment, the agent is a mGluRSM nucleic acid molecule. In yet another 
embodiment, the agent is a mGluRSM protein, peptide of peptidomimetic. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant mGluR expression or activity by 

20 administering a mGluRSM protein, peptide, antibody, nucleic acid molecule, 

peptidomimetic or other small molecule to the subject. In a preferred embodiment, the 
disorder characterized by aberrant mGluRSM protein or nucleic acid expression is 
central nervous system disorder. 

The present invention also provides a diagnostic assay for identifying the 

25 presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding a mGluR5M protein; (ii) mis-regulation of 
said gene; and (iii) aberrant post-translational modification of a mGluR5M protein, 
wherein a wild-type form of said gene encodes an protein with a mGluRSM activity. 

In another aspect the invention provides a method for identifying a compound 

30 that binds to or modulates the activity of a mGluR5M protein. In one embodiment, the 
invention provides a method for identifying a compound which binds to a mGluRSM 
protein which involves contacting the mGluRSM protein with a test compound 
(optionally in addition to mGluR5 ligand) and determining whether the mGluRSM 
protein binds to the test compound. In another embodiment, the invention provides a 

35 method for identifying a compound which binds to or modulates the activity of a 

mGluRSM protein which involves contacting a cell expressing a mGluRS protein with a 
mGluRSM protein and a test compound, and determining the effect of the test compound 
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on the binding or activity of the mGluRSM protein to thereby identify a compound 
which modulates the activity of the mGluRSM protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

5 

Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence and predicted amino acid sequence of 
human mGluRSJVL The nucleotide sequence (panel A) corresponds to nucleic acids 1 to 
1823 of SEQ ID NO:l. The coding region is underlined. The amino acid sequence 
10 (panel B) corresponds to amino acids 1 to 369 of SEQ ID NO:2. 

Figure 2 depicts a multiple sequence alignment (MSA) of the amino acid 
q sequences of human mGluRSM (SEQ ID NO:2), the first 370 amino acid residues of 

human mGluRS (amino acid residues 1 to 370 of SEQ ID NO:4), and the first 369 amino 
acid residues of rat mGluRS (amino acid residues 1 to 369 of SEQ ID NO:5). The 
1 5 alignment was performed using the Clustal algorithm which is part of the MEGALIGN 
I *J program (e.g. , version 3.1.7) which is part of the DNASTAR sequence analysis software 

package. The pairwise alignment parameters are as follows: K-tuple = 1; Gap Penalty = 
3; Window = 5; Diagonals saved = 5 . The multiple alignment parameters are as 
follows: Gap Penalty =10; and Gap length penalty =10. The asterisks indicate 
^ 20 important conserved residues in proteins aligned. 
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Detailed Description of the Invention 

The present invention is based on the discovery of novel molecules, referred to 
herein as metabotropic glutamate receptor subtype 5 modulatory ("mGluRSM") protein 

25 and nucleic acid molecules, which comprise a family of molecules having certain 
conserved structural and functional features. The term "family" when referring to the 
protein and nucleic acid molecules of the invention is intended to mean two or more 
proteins or nucleic acid molecules having a common structural domain and having 
sufficient amino acid or nucleotide sequence homology or identity as defined herein. 

30 Such family members can be naturally occurring and can be from either the same or 
different species. For example, a family can contain a first protein of human origin, as 
well as other, distinct proteins of human origin or alternatively, can contain homologues 
of non-human origin. Members of a family may also have common functional 
characteristics. 

35 For example, the mGluR5M proteins of the present invention bear significant 

homology to the N-terminal extracellular domain of the metabotropic glutamate 
receptors, mGluR5a and mGluR5b. The metabotropic glutamte receptors are known to 
share certain conserved amino acid residues, some of which have been determined to be 
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important in ligand binding and/or receptor function. For example, the metabotropic 
glutamate receptors are known to share at least 19 conserved cysteine residues within the 
N-terminal extracellular domain and extracellular loops which are proposed to be 
important in the three dimensional structure of the receptors and/or in intramolecular 
5 transduction. At least three of these conserved cysteine residues are present in the 

human mGluRSM amino acid sequence set forth as SEQ ID NO:2. Moreover, based on 
homology of the N-terminal extracellular domains of the metabotropic glutamate 
receptors to known bacterial periplasmic binding proteins (PBLs), a serine residue 
(located at amino acid 152 in the human mGluRS sequence set forth as SEQ ID NO:4) 
10 and a threonine residue (located at at amino acid 175 in the human mGluRS sequence set 
forth as SEQ ID NO:4) are predicted to be important in glutamate binding, both of 
which are present in the human mGluR5M amino acid sequence set forth as SEQ ID 
NO:2 (i.e., Serl52 and Thrl75 of SEQ ID NO:2). These important conserved residues 
are indicated by asterisks in Figure 2. 
1 5 In one aspect, the mGluRSM proteins of the present invention are proteins 

having an amino acid sequence of about 330-410, preferably about 340-400, more 
preferably about 350-390, more preferably about 360-380, or about 369-370 amino acids 
in length, and include at least one domain or consensus sequence characteristic of the 
jry family of mGluRSM family, as described herein. In another embodiment, the 

f*f 20 mGluR5M proteins of the present invention contain a signal sequence. As used herein, a 

g "signal sequence" includes a peptide of about 20 amino acid residues in length which 

M occurs at the N-terminus of secretory and integral membrane proteins and which 

contains at least 30% hydrophobic amino acid residues. In a preferred embodiment, a 
signal sequence contains at least about 15, 20, 25, 30, 35, 40 or more amino acid 
25 residues and has at least about 30%, 35%, 40%, 45%, 50%, 55%, 60% or more 
hydrophobic amino acid residues (e.g., Alanine, Valine, Leucine, Isoleucine, 
Phenylalanine, Tyrosine, Tryptophan, or Proline). Such a "signal sequence", also 
referred to in the art as a "signal peptide", serves to direct a protein containing such a 
sequence to a lipid bilayer. For example, a signal sequence can be found about amino 
30 acids 1-20 of SEQ ID NO:2 (Metl to Ala20 of the mGluR5M amino acid sequence set 
forth as SEQ ID NO:2). In yet another embodiment, a mGluRSM protein is a mature 
mGluRS M protein. As used herein, the term "mature mGluRSM protein" refers to a 
mGluRS M protein from which the signal peptide has been cleaved. In an exemplary 
embodiment, a mature mGluRSM protein contains amino acid residues 21-369 of SEQ 
35 IDNO:2. 

In another embodiment, a mGluRSM protein of the present invention includes an 
N-terminal mGluR-like domain. As used herein, the term "N-terminal mGluR-like 
domain" refers to a protein domain having an amino acid sequence of about 250-350 



-7- 



ATTORNEY DOCKET NO.: GNN-024 



5 i 



amino acids which shares at least about 80% identity with a mGluR amino acid 
sequence. Preferably, an M N-terminal mGluR-like domain" includes a protein domain 
having an amino acid sequence of about 260-340, 270-330, 280-320, 290-310 or about 
300 amino acid residues (e.g., about 302 amino acid residues) and shares at least about 
5 85% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identity with a 
mGluR amino acid sequence. In an exemplary embodiment, a mGluR5M protein 
contains anN-terminal mGluR-like domain including about amino acids 1-303 of SEQ 
IDNO:2. 

In yet another embodiment, a mGluR5M protein of the present invention 
10 includes a C-terminal unique domain. The term "C-terminal unique domain" as used 
herein, refers to a protein domain of a mGluR5M protein family member which includes 
amino acid residues C-terminal to the N-terminal mGluR-like domain in the amino acid 

Q. sequence of the mGluR5M protein. As used herein, a "C-terminal unique domain" 

ff i 

H refers to a protein domain which is at about 50-75 amino acid residues in length, 

*J| 1 5 preferably at least about 55-70 amino acid residues in length, and more preferably at 

least about 60-65 amino acid residues in length (e.g., about 63 amino acid residues in 
length), and has at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98%, 99% or more identity with the C-terminal amino acid 
sequence of another mGluRSM family member. As further defined herein, a C-terminal 
20 unique domain of a mGluRSM protein family member, however, is not sufficiently 

homologous to the amino acid sequence of a mGluR. For example, a C-terminal unique 
domain of mGluR5M homologue has at least about 80% homology to the C-terminal 
unique domain of human mGluR5M from about amino acids 304-369 of SEQ ID NO:2 
but has no significant homology to the amino acid sequence of human mGluR5. 

25 A mGluR5M family member (or an N-terminal mGluR-like domain) can 

further be identified based on the presence of at least one G-protein coupled 
receptor family 3 signature consensus sequence. For example, a mGluRSM 
protein can include a G-protein coupled receptor family 3_1 consensus having the 
sequence [LV]-X-N-[LIVM](2)-X-L-F-X-I-[PA]-Q-[LIVM]-[STA]-X-[STA](3)- 
30 [STAN] (SEQ ID NO:6). The consensus sequence described herein is described 
according to standard Prosite Signature designation (e.g., all amino acids are 
indicated according to their universal single letter designation; X designates any 
amino acid; X(n) designates any n amino acids, e.g. , X (2) designates any 2 amino 
acids; and [LIVM] indicates any one of the amino acids appearing within the 
35 brackets, e.g., any one of L, I, V, or M, in the alternative, any one of Leu, lie, Val, 
or Met.) 
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In yet another embodiment, a mGluRSM protein of the present invention lacks a 
transmembrane domain. As used herein, a "transmembrane domain" includes a protein 
domain having at least about 10 amino acid residues of which about 60% of the amino acid 
residues contain non-polar side chains, for example, alanine, valine, leucine, isoleucine, 
5 proline, phenylalanine, tryptophan, and methionine. In a preferred embodiment, a 

"transmembrane domain" includes a protein domain having at least about 13, preferably 
about 16, more preferably about 19, and even more preferably about 21, 23, 25, 30, 35 or 
40 amino acid residues, of which at least about 70%, preferably about 80%, and more 
preferably about 90% of the amino acid residues contain non-polar side chains, for 
1 0 example, alanine, valine, leucine, isoleucine, proline, phenylalanine, tryptophan, and 

methionine. A transmembrane domain is lipophillic in nature, and spans a membrane of a 
cell (e.g., spans the cell membrane or an organelle membrane in a eukaryotic cell). Proteins 
q or polypeptides having a transmembrane domain (or transmembrane domains) are 

fU commonly referred to as "membrane-bound" proteins or polypeptides. A protein or 

1 5 polypeptide that "lacks a transmembrane domain" is conversly, not membrane-bound, for 



Iff example is cytosolic, secreted or exists within an organelle-millieu. 



in 



Preferred mGluRSM molecules of the present invention have an amino acid 



IU sequence sufficiently homologous or identical to the mGluRSM protein having the amino 



acid sequence of SEQ ID NO:2. As used herein, the term "sufficiently homologous" or 

20 "sufficiently identical" refers to a first amino acid or nucleotide sequence which contains a 
sufficient or minimum number of identical or equivalent (e.g., an amino acid residue which 
has a similar side chain) amino acid residues or nucleotides to a second amino acid or 
nucleotide sequence such that the first and second amino acid or nucleotide sequences share 
common structural domains and/or a common functional activity. For example, amino acid 

25 or nucleotide sequences which share common structural domains share at least about 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 
more identity across the amino acid sequences of the domains and contain at least one and 
preferably two structural domains, are defined herein as sufficiently homologous or 
identical. Furthermore, amino acid or nucleotide sequences which share at least about 

30 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
99% or more identity and share a common functional activity are defined herein as 
sufficiently homologous or identical. 

As used interchangeably herein, a "mGluR5M activity", "biological activity of 
mGluR5M" or "functional activity of mGluRSM", refers to an activity exerted by a 

35 mGluR5M protein, polypeptide or nucleic acid molecule, for example, an activity 
exerted on a mGluR-expressing cell, as determined in vivo, or in vitro, according to 
standard techniques. 
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In a preferred aspect on the invention, a "mGluRSM activity", "biological 
activity of mGluRSM" or "functional activity of mGluRSM" includes modulation (e.g., 
enhancement or inhibition) of glutamate receptor functions and/or activities, in 
particular, metabotropic glutamate receptor functions and/or activities (e.g., mGluR5 
5 functions and/or activities). In one embodiment, a mGluRSM protein of the present 
invention modulates glutamate receptor function and/or activity directly, for example, by 
binding to, enhancing or inhibiting receptor activity in a ligand dependent or 
independent manner. In another embodiment, a mGluR5M protein of the present 
invention modulates glutamate receptor function and/or activity indirectly, e.g., by 
10 influencing glutamate receptor ligand availability. For example, a mGluRSM protein 
can regulate neurotransmission, neurotoxicity, exitotoxicity and/or metabolic function 
by direct binding of glutamate. 

In a preferred embodiment, a mGluRSM activity is at least one of the following 
activities: (1) modulation of G protein linked second messenger signaling pathways 
fe ?! 15 (e.g. , modulation of diacylglycerol and/or inisitol triphosphate-mediated signaling 

Hi pathways), for example, signaling pathways involved in neuronal cell signalling and 

if - 

nervous system function; (2) modulation of glutamatergic transmission; (3) modulation 
jU of neuronal excitability; (4) regulation of synaptic transmission; (5) modulation of 

Irs I 

iff neurotransmitter release (e.g., glutamate release); (6) regulation of voltage-dependent 

20 and/or voltage-independent and/or ligand-gated ion channels (e.g. , K+ channels or Ca2+ 
channels); (7) regulation of neuronal development (e.g., regulation of neuronal 
differentiation, migration and/or survival in the developing brain); (8) modulatoin of 
nervous system function; and (9) modulation of neurodegenerative processes (e.g., acute 
or chronic neurodegenerative processes). In yet another embodiment, a mGluRSM 
25 activity is modulation of mGluR5 dimerization (e.g., mGluRSa and/or mGluR5b 
dimerization) and/or dimerization of other mGluR family members (e.g., mGluRl 
dimerization). 

Accordingly, the mGluRSM proteins of the present invention (as well as 
peptides, nucleic acid molecules, antibodies, peptidomimetics and small molecules) may 
30 be useful in treating, for example, neurodegenerative disorders and/or diseases (e.g., 

motor neuron disease (MND), amyotrophic lateral sclerosis (ALS), Huntington's chorea, 
Parkinson's disease and Alzheimer's disease), stoke, the brain damage occurring acutely 
after status epilepticus, cerebral ischemia or traumatic brain injury and/or movement 
disorders. The mGluRSM proteins of the present invention (as well as peptides, nucleic 
35 acid molecules, antibodies, peptidomimetics and small molecules) may also be useful as 
cognitive enhancers, anti epileptic agents and/or in the treatment of pain and/or 
hypertension. Other clinical conditions that may respond to treatment with the 
mGluRSM proteins of the present invention include epilepsy, amnesia, anxiety, 
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hyperalgesia and/or psychiatric disorders (e.g., schizophrenia and/or other psychoses). 
In a preferred, the mGluRSM proteins of the present invention (as well as peptides, 
nucleic acid molecules, antibodies, peptidomimetics and small molecule modulators) are 
useful in the treatment of shizophrenia and/or other psychiatric disorders including, but 
5 not limited to schizoaffective disorder, bipolar affective disorder, unipolar affective 
disorder and adolescent conduct disorder. 

A preferred embodiment of the invention features isolated mGluRSM proteins 
and polypeptides having a mGluRSM activity, as described herein. Preferred mGluRSM 
proteins have at least an N-terminal mGluR-like domain and/or a C-terminal unique 
1 0 domain and a mGluRSM activity. In a preferred embodiment, a mGluRSM protein has a 
G-protein coupled receptor family 31 consensus sequence and a mGluR5M activity. In 
another preferred embodiment, a mGluRSM protein has at least an N-terminal mGluR- 
p| like domain and/or a C-terminal unique domain and a G-protein coupled receptor family 

3_1 consensus sequence, a mGluRSM activity, and an amino acid sequence sufficiently 



ru 

1 5 homologous or identical to the amino acid sequence of SEQ ID NO:2 



til A human mGluRSM cDNA was identified from a library derived from human 

i i 

* w adult whole brain RNA, which is approximately 1 823 nucleotides in length, and which 

hh encodes a protein which is approximately 369 amino acid residues in length. The 

human mGluR5M protein contains an N-terminal mGluR-like domain at about amino 

i y 

i,I 20 acids 1 to 303 of SEQ ID NO:2 and contains a C-termnal unique domain at about amino 

O acids 304 to 369 of SEQ ID NO:2. The human mGiuR5M protein further contains a G- 

protein coupled receptor family 3 1 consensus seqeunce at about amino acids 158 to 176 

of SEQ ID NO:2. 

A plasmid containing the nucleotide sequence encoding human mGluSM 
25 (referred to interchangeably herein as YI176) was deposited with the American Type 
Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, 
on December 12, 2000 and assigned Accession Number PTA-2775. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was 
30 made merely as a convenience for those of skill in the art and is not an admission that a 
deposit is required under 35 U.S.C. §112. 

Various aspects of the invention are described in further detail in the following 
subsections: 



35 



I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 
encode mGluRSM proteins or biologically active portions thereof, as well as nucleic 
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acid fragments sufficient for use as hybridization probes to identify mGluRSM-encoding 
nucleic acids (e.g., mGluR5M mRNA) and fragments for use as PCR primers for the 
amplification or mutation of mGluRSM nucleic acid molecules. As used herein, the 
term "nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or 
5 genomic DNA) and RNA molecules (e.g. , mRNA) and analogs of the DNA or RNA 
generated using nucleotide analogs. The nucleic acid molecule can be single-stranded or 
double-stranded, but preferably is double-stranded DNA. 

An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid. Preferably, an 

1 0 "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (/. e. , 
sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated mGluRSM nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 
2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic 

1 5 acid molecule in genomic DNA of the cell from which the nucleic acid is derived. 
Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or substantially free of chemical precursors or other chemicals 
when chemically synthesized. 

20 A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO:l, the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number PTA-2775, or a 
portion thereof, can be isolated using standard molecular biology techniques and the 
sequence information provided herein. Using all or portion of the nucleic acid sequence 

25 of SEQ ID NO: 1 , or the nucleotide sequence of the DNA insert of the plasmid deposited 
with ATCC as Accession Number PTA-2775 as a hybridization probe, mGluRSM 
nucleic acid molecules can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning; A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 

30 Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l, or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number PTA-2775 can be isolated by the polymerase chain 
reaction (PCR) using synthetic oligonucleotide primers designed based upon the 

35 sequence of SEQ ID NO: 1 , or the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number PTA-2775. 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
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according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to mGluR5M nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 
5 In a preferred embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO:L The sequence of SEQ ID 
NO:l corresponds to the human mGluR5M cDNA. This cDNA comprises sequences 
encoding the human mGluR5M protein {i.e., "the coding region", from nucleotides 4- 
1113, including a stop codon from nucleotides 1 1 10-1 1 13), as well as 5' untranslated 
10 sequences (nucleotides 1-3) and 3' untranslated sequences (nucleotides 1110 or 1113- 
1823). Alternatively, the nucleic acid molecule can comprise only the coding region of 
SEQ ID NO:l {e.g., nucleotides 4 to 1 1 10 or 1 1 13, corresponding to SEQ ID NO:3). 
In another preferred embodiment, an isolated nucleic acid molecule of the 
ffi invention comprises a nucleic acid molecule which is a complement of the nucleotide 

N 1 5 sequence shown in SEQ ID NO: 1 , the nucleotide sequence of the DNA insert of the 

Iff plasmid deposited with ATCC as Accession Number PTA-2775, or a portion of any of 

III these nucleotide sequences. A nucleic acid molecule which is complementary to the 

f s nucleotide sequence shown in SEQ ID NO: 1 , or the nucleotide sequence of the DNA 

Pl j insert of the plasmid deposited with ATCC as Accession Number PTA-2775 is one 

20 which is sufficiently complementary to the nucleotide sequence shown in SEQ ID NO:l , 
PI or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

M Accession Number PTA-2775, thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
present invention comprises a nucleotide sequence which is at least about 60%, 65%, 
25 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more 
identical to the nucleotide sequences shown in SEQ ID NO:l, or the nucleotide sequence 
of the DNA insert of the plasmid deposited with ATCC as Accession Number PTA- 
2775, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
30 of the nucleic acid sequence of SEQ ID NO:l, or the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number PTA-2775, for 
example a fragment which can be used as a probe or primer or a fragment encoding a 
biologically active portion of a mGluRSM protein. The nucleotide sequence determined 
from the cloning of the mGluR5M genes allows for the generation of probes and primers 
35 designed for use in identifying and/or cloning other mGluRSM family members, as well 
as mGluR5M homologues from other species. The probe/primer typically comprises 
substantially purified oligonucleotide. The probe/primer {e.g., oligonucleotide) typically 
comprises about 5-10, 10-15, 15-20, 20-25 or more nucleotides that hybridize under 
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5s 



stringent conditions to a sense sequence of SEQ ID NO:l or the nucleotide sequence of 
the DNA insert of the plasmid deposited with ATCC as Accession Number PTA-2775, 
to an anti-sense sequence of SEQ ID NO:l or the nucleotide sequence of the DNA insert 
of the plasmid deposited with ATCC as Accession Number PTA-2775, or to a naturally 
5 occurring mutant of SEQ ID NO: 1 . Alternatively, the probe/primer (e.g. , 

oligonucleotide) comprises about 5-10, 10-15, 15-20, 20-25 or more consecutive 
nucleotides of a sense sequence of SEQ ID NO:l or the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number PTA-2775, of an anti- 
sense sequence of SEQ ID NO: 1 or the nucleotide sequence of the DNA insert of the 
1 0 plasmid deposited with ATCC as Accession Number PTA-2775, or of a naturally 
occurring mutant of SEQ ID NO: 1 . An exemplary probe/primer for detection of 
pi mGluRSM seqeunces comprises nucleotides of a sense or antisense sequence of SEQ ID 

HI NO: 1 from about nucleotides 880 to 1 823. 

Probes based on the mGluR5M nucleotide sequences can be used to detect 
1 5 transcripts or genomic sequences encoding the same or homologous proteins. In 
J preferred embodiments, the probe further comprises a label group attached thereto, e.g. , 

the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme 
co-factor. Such probes can be used as a part of a diagnostic test kit for identifying cells 
pi or tissue which misexpress a mGluRSM protein, such as by measuring a level of a 

M 20 mGluRSM-encoding nucleic acid in a sample of cells from a subject e.g. 9 detecting 

mGluR5M mRNA levels or determining whether a genomic mGluR5M gene has been 
mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of a mGluR5M 
protein" can be prepared by isolating a portion of the nucleotide sequence of SEQ ID 
25 NO:l, or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number PTA-2775, which encodes a polypeptide having a 
mGluRSM biological activity (the biological activities of the mGluR5M proteins have 
previously been described), expressing the encoded portion of the mGluRSM protein 
(e.g. , by recombinant expression in vitro) and assessing the activity of the encoded 
30 portion of the mGluRSM protein. In an exemplary embodiment, a nucleic acid molecule 
encoding a biologically active portion of a protein of the present invention comprises a 
nucleotide sequence which is greater that 100, 200, 300, 400, 500, 600, 700, 800, 900, 
1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or more nucleotides in length 
and hybridizes under stringent hybridization conditions to a complement of the nucleic 
35 acid molecule of SEQ ID NO: 1 or the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession Number PTA-2775. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l or the nucleotide sequence of the DNA 
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insert of the plasmid deposited with ATCC as Accession Number PTA-2775, due to 
degeneracy of the genetic code and thus encode the same mGluR5M proteins as those 
encoded by the nucleotide sequence shown in SEQ ID NO:l . In another embodiment, 
an isolated nucleic acid molecule of the invention has a nucleotide sequence encoding a 
5 protein having an amino acid sequence shown in SEQ ID NO:2. 

In addition to the mGluRSM nucleotide sequences shown in SEQ ID NO:l, or 
the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number PTA-2775, it will be appreciated by those skilled in the art that DNA 
sequence polymorphisms that lead to changes in the amino acid sequences of the 
] 0 mGluR5M proteins may exist within a population (e.g. , the human population). Such 
genetic polymorphism in the mGluRSM genes may exist among individuals within a 
population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
f II encoding a mGluRSM protein, preferably a mammalia mGluR5M protein. Such natural 

1 5 allelic variations can typically result in 1-5% variance in the nucleotide sequence of a 
mGluRSM gene. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in mGluR5M genes that are the result of natural allelic variation and that 
do not alter the functional activity of a mGluR5M protein are intended to be within the 
scope of the invention. 

20 Moreover, nucleic acid molecules encoding other mGluR5M family members 

and thus which have a nucleotide sequence which differs from the mGluR5M sequence 
of SEQ ID NO:l are intended to be within the scope of the invention. For example, a 
primate mGluRSM cDNA can be identified based on the nucleotide sequence of human 
mGluR5M. Nucleic acid molecules encoding mGluRSM proteins from different 
25 species, and thus which have a nucleotide sequence which differs from the mGluR5M 
sequences of SEQ ID NO:l are intended to be within the scope of the invention. 

Nucleic acid molecules corresponding to natural allelic variants and homologues 
of the mGluRSM cDNAs of the invention can be isolated based on their homology to the 
mGluRSM nucleic acids disclosed herein using the cDNAs disclosed herein, or a portion 
30 thereof, as a hybridization probe according to standard hybridization techniques under 
stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15 nucleotides in length and hybridizes under stringent conditions to 
a complement of the nucleic acid molecule comprising the nucleotide sequence of SEQ 
35 ID NO:l or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number PTA-2775. In another embodiment, the nucleic acid is at 
least 30, 50, 100, 250 or 500 nucleotides in length and hybridizes under stringent 
conditions to a complement of a nucleic acid molecule comprising the nucleotide 
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sequence of SEQ ID NO:l or the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number PTA-2775. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences 
5 that are significantly identical or homologous to each other remain hybridized to each 
other. Preferably, the conditions are such that sequences at least about 70%, more 
preferably at least about 80%, even more preferably at least about 85% or 90% identical 
to each other remain hybridized to each other. Such stringent conditions are known to 
those skilled in the art and can be found in Current Protocols in Molecular Biology, 
10 Ausubel et al, eds., John Wiley & Sons, Inc. (1995), sections 2, 4 and 6. Additional 
stringent conditions can be found in Molecular Cloning: A Laboratory Manual, 
Sambrook et al, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), chapters 7, 

CI 9 and 1 1 . A preferred, non-limiting example of stringent hybridization conditions 

PI I 

V\ includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 65-70°C (or 

ly 1 5 hybridization in 4X SSC plus 50% fbrmamide at about 42-50°C) followed by one or 
p 5 : more washes in IX SSC, at about 65-70°C. A preferred, non-limiting example of highly 

* stringent hybridization conditions includes hybridization in IX SSC, at about 65-70°C 

H u (or hybridization in IX SSC plus 50% formamide at about 42-50°C) followed by one or 

ft i 

more washes in 0.3X SSC, at about 65-70°C. A preferred, non-limiting example of 
H 20 reduced stringency hybridization conditions includes hybridization in 4X SSC, at about 

|*f 50-60°C (or alternatively hybridization in 6X SSC plus 50% formamide at about 40-45° 

C) followed by one or more washes in 2X SSC, at about 50-60°C. Ranges intermediate 
to the above-recited values, e.g., at 65 -70°C or at 42-5 0°C are also intended to be 
encompassed by the present invention. SSPE (lxSSPE is 0.1 5M NaCl, lOmM 
25 NaH 2 P0 4 , and 1 .25mM EDTA, pH 7.4) can be substituted for SSC (IxSSC is 0. 1 5M 
NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 1 5 minutes each after hybridization is complete. The hybridization 
temperature for hybrids anticipated to be less than 50 base pairs in length should be 5- 
10°C less than the melting temperature (T m ) of the hybrid, where T m is determined 
30 according to the following equations. For hybrids less than 18 base pairs in length, 
T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 49 
base pairs in length, T m (°C) = 81.5 + 16.6(log 10 [Na + ]) + 0.41 (%G+C) - (600/N), where N 
is the number of bases in the hybrid, and [Na + ] is the concentration of sodium ions in the 
hybridization buffer ([Na + ] for IxSSC = 0.165 M). It will also be recognized by the 
35 skilled practitioner that additional reagents may be added to hybridization and/or wash 
buffers to decrease non-specific hybridization of nucleic acid molecules to membranes, 
for example, nitrocellulose or nylon membranes, including but not limited to blocking 
agents (e.g., BSA or salmon or herring sperm carrier DNA), detergents (e.g., SDS), 
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chelating agents (e.g., EDTA), Ficoll, PVP and the like. When using nylon membranes, 
in particular, an additional preferred, non-limiting example of stringent hybridization 
conditions is hybridization in 0.25-0.5M NaH 2 P0 4 , 7% SDS at about 65°C, followed by 
one or more washes at 0.02M NaH 2 P0 4 , 1% SDS at 65°C, see e.g., Church and Gilbert 
5 (1984) Proc. Natl Acad. Sci. USA 81:1991-1995, (or alternatively 0.2X SSC, 1% SDS). 
Preferably, an isolated nucleic acid molecule of the invention that hybridizes 
under stringent conditions to a complement of the sequence of SEQ ID NO:l or 3 and 
corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
10 a nucleotide sequence that occurs in nature (e.g., encodes a natural polypeptide). 

In addition to naturally-occurring allelic variants of the mGluRSM sequences 
H that may exist in the population, the skilled artisan will further appreciate that changes 

'fl can be introduced by mutation into the nucleotide sequences of SEQ ID NO: 1 or the 

III nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

I: 1 5 Accession Number PTA-2775, thereby leading to changes in the amino acid sequence of 

fi J the encoded mGluRSM proteins, without altering the functional ability of the mGluR5M 

w proteins. For example, nucleotide substitutions leading to amino acid substitutions at 

m "non-essential" amino acid residues can be made in the sequence of SEQ ID NO:l or the 

fll nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

fT 20 Accession Number PTA-2775. A "non-essential" amino acid residue is a residue that 

can be altered from the wild-type sequence of mGluR5M (e.g., the sequence of SEQ ID 
NO:2) without altering the biological activity, whereas an "essential" amino acid residue 
is required for biological activity. For example, amino acid residues that are conserved 
among the mGluR5M proteins of the present invention and mGluRS proteins and 
25 asterisked in Figure 2 are predicted to be particularly unamenable to alteration. 

Moreover, amino acid residues that are defined by the G protein coupled receptor family 
3 1 consensus sequence are particularly unamenable to alteration. Furthermore, 
additional amino acid residues that are conserved between the mGluRSM proteins of the 
present invention and other members of the G protein coupled receptor protein family 
30 (e.g., the mGlu5R proteins set forth in Figure 2) are not likely to be amenable to 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding mGluR5M proteins that contain changes in amino acid residues that are not 
essential for activity. Such mGluR5M proteins differ in amino acid sequence from SEQ 
35 ID NO:2 yet retain biological activity. In one embodiment, the isolated nucleic acid 
molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 



1 ; 
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90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to SEQ ID 
NO:2. 

An isolated nucleic acid molecule encoding a mGluRSM protein homologous to 
the protein of SEQ ID NO:2 can be created by introducing one or more nucleotide 
5 substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO: 1 , such 
that one or more amino acid substitutions, additions or deletions are introduced into the 
encoded protein. Mutations can be introduced into SEQ ID NO:l by standard 
techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are made at one or more predicted 

10 non-essential amino acid residues. A "conservative amino acid substitution" is one in 
which the amino acid residue is replaced with an amino acid residue having a similar 
side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g. , 
lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), 

1 5 uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted nonessential amino acid residue in a mGluRSM 

20 protein is preferably replaced with another amino acid residue from the same side chain 
family. Alternatively, in another embodiment, mutations can be introduced randomly 
along all or part of a mGluR5M coding sequence, such as by saturation mutagenesis, and 
the resultant mutants can be screened for mGluR5M biological activity to identify 
mutants that retain activity. Following mutagenesis of SEQ ID NO:l, or the encoded 

25 protein can be expressed recombinantly and the activity of the protein can be 
determined. 

In a preferred embodiment, a mutant mGluRSM protein can be assayed for the 
ability to (1) modulate G protein linked second messenger signaling pathways (e.g., 
modulate diacylglycerol and/or inisitol triphosphate-mediated signaling pathways); (2) 

30 modulate glutamatergic transmission; (3) modulate neuronal excitability; (4) regulate 
synaptic transmission; (5) modulate neurotransmitter release (e.g., glutamate release); 
(6) regulate voltage-dependent and/or voltage-independent and/or ligand-gated ion 
channels (e.g., K+ channels or Ca2+ channels); (7) regulate neuronal development (e.g., 
regulate neuronal differentiation, migration and/or survival in the developing brain); (8) 

35 modulate neurodegenerative processes (e.g., acute or chronic neurodegenerative 

processes); and/or (9) modulate mGluR5 dimerization (e.g., mGluR5a and/or mGluR5b 
dimerization). 
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In addition to the nucleic acid molecules encoding mGluRSM proteins described 
above, another aspect of the invention pertains to isolated nucleic acid molecules which 
are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence 
which is complementary to a "sense" nucleic acid encoding a protein, e.g., 
5 complementary to the coding strand of a double-stranded cDNA molecule or 

complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be 
complementary to an entire mGluRSM coding strand, or to only a portion thereof. In 
one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of 
10 the coding strand of a nucleotide sequence encoding mGluR5M. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues {e.g., the coding region of human mGluRSM 
:;| corresponds to SEQ ID NO:3). In another embodiment, the antisense nucleic acid 

molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
15 sequence encoding mGluRSM. The term "noncoding region" refers to 5' and 3' 
f U sequences which flank the coding region that are not translated into amino acids (i. e. , 

also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding mGluR5M disclosed herein (e.g., 
SEQ ID NO:3), antisense nucleic acids of the invention can be designed according to the 
20 rules of Watson and Crick base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of mGluRSM mRNA, but more preferably is 
an oligonucleotide which is antisense to only a portion of the coding or noncoding 
region of mGluRSM mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of mGluRSM mRNA. 
25 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 
or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis and enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
30 variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
35 chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 

(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-24hiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2- 



ru 
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methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5 r -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
5 queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 

methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been subcloned in an antisense 
10 orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

1 5 and/or genomic DNA encoding a mGluR5M protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 

20 administration of antisense nucleic acid molecules of the invention include direct 

injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 

25 nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

30 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an a-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual p-units, the strands run parallel to each other (Gaultier et al (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 

35 methylribonucleotide (Inoue et al (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
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capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave mGluR5M mRNA transcripts to thereby inhibit translation of 
5 mGluR5M mRNA. A ribozyme having specificity for a mGluRSM-encoding nucleic 
acid can be designed based upon the nucleotide sequence of a mGluRSM cDNA 
disclosed herein (i.e., SEQ ID NO:l). For example, a derivative of a Tetrahymena L-19 
IVS RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in a mGluR5M-encoding 
10 mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent 
No. 5,1 16,742. Alternatively, mGluR5M mRNA can be used to select a catalytic RNA 

H having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel, 

g D. and Szostak, J.W. (1993) Science 261:141 1-1418. 

f\\ Alternatively, mGluR5M gene expression can be inhibited by targeting 

1 5 nucleotide sequences complementary to the regulatory region of the mGluR5M (e.g. , the 
f y mGluRSM promoter and/or enhancers) to form triple helical structures that prevent 

lu transcription of the mGluR5M gene in target cells. See generally, Helene, C. (1991) 

L Anticancer Drug Des. 6(6):569-84; Helene, C. et al. (1 992) Ann. N. Y. Acad. Sci 660:27- 

111 36; and Maher, L.J. (1992) Bioassays 14(12):807-15. 

? *f 20 In yet another embodiment, the mGluR5M nucleic acid molecules of the present 

r g invention can be modified at the base moiety, sugar moiety or phosphate backbone to 

H improve, e.g., the stability, hybridization, or solubility of the molecule. For example, 

the deoxyribose phosphate backbone of the nucleic acid molecules can be modified to 
generate peptide nucleic acids (see Hyrup B. et al (1996) Bioorganic & Medicinal 
25 Chemistry 4 (1): 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" 
refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate 
backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases 
are retained. The neutral backbone of PNAs has been shown to allow for specific 
hybridization to DNA and RNA under conditions of low ionic strength. The synthesis 
30 of PNA oligomers can be performed using standard solid phase peptide synthesis 

protocols as described in Hyrup B. et al (1996) supra; Perry-O'Keefe et al PNAS 93: 
14670-675. 

PNAs of mGluR5M nucleic acid molecules can be used in therapeutic and 
diagnostic applications. For example, PNAs can be used as antisense or antigene agents 
35 for sequence-specific modulation of gene expression by, for example, inducing 

transcription or translation arrest or inhibiting replication. PNAs of mGluRSM nucleic 
acid molecules can also be used in the analysis of single base pair mutations in a gene, 
(e.g., by PNA-directed PCR clamping); as 'artificial restriction enzymes' when used in 



-21 - 



ATTORNEY DOCKET NO.: GNN-024 

combination with other enzymes, (e.g., SI nucleases (Hyrup B. (1996) supra)); or as 
probes or primers for DNA sequencing or hybridization (Hyrup B. et al (1996) supra; 
Perry-O'Keefe supra). 

In another embodiment, PNAs of mGluR5M can be modified, (e.g., to enhance 
5 their stability or cellular uptake), by attaching lipophilic or other helper groups to PNA, 
by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of mGluRSM 
nucleic acid molecules can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, (e.g., 
10 RNAse H and DNA polymerases), to interact with the DNA portion while the PNA 

portion would provide high binding affinity and specificity. PNA-DNA chimeras can be 
^ linked using linkers of appropriate lengths selected in terms of base stacking, number of 

Q bonds between the nucleobases, and orientation (Hyrup B. (1 996) supra). The synthesis 

! of PNA-DNA chimeras can be performed as described in Hyrup B. (1996) supra and 

J 15 Finn P.J. et al (1996) Nucleic Acids Res. 24 (17): 3357-63. For example, a DNA chain 

| can be synthesized on a solid support using standard phosphoramidite coupling 

W 

B chemistry and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-deoxy- 

thymidine phosphoramidite, can be used as a between the PNA and the 5 r end of DNA 
(Mag, M. et al (1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then 
20 coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment 
and a 3 ? DNA segment (Finn P. J. et al (1996) supra). Alternatively, chimeric molecules 
can be synthesized with a 5' DNA segment and a 3' PNA segment (Petersen K.H. et al 
(1975) Bioorganic Med. Chem. Lett. 5: 1119-11124). 

In other embodiments, the oligonucleotide may include other appended groups 
25 such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al (1989) Proc. Natl Acad. 
Sci. US. 86:6553-6556; Lemaitre et al (1987) Proc. Natl. Acad. Sci. USA 84:648-652; 
PCT Publication No. W088/09810, published December 15, 1988) or the blood-brain 
barrier (see, e.g., PCT Publication No. W089/10134, published April 25, 1988). In 
30 addition, oligonucleotides can be modified with hybridization-triggered cleavage agents 
(See, e.g., Krol et al (1988) BioTechniques 6:958-976) or intercalating agents. (See, 
e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking 
agent, transport agent, or hybridization-triggered cleavage agent). 



35 



II. Isolated mGluR5M Proteins and Anti-mGluR5M Antibodies 

One aspect of the invention pertains to isolated mGluR5M proteins, and 
biologically active portions thereof, as well as polypeptide fragments suitable for use as 
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immunogens to raise anti-mGluR5M antibodies. In one embodiment, native mGluRSM 
proteins can be isolated from cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, 
mGluRSM proteins are produced by recombinant DNA techniques. Alternative to 
5 recombinant expression, a mGluRSM protein or polypeptide can be synthesized 
chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the mGluRSM protein is derived, or substantially free from 
10 chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of mGluRSM protein in 
which the protein is separated from cellular components of the cells from which it is 
(3 isolated or recombinantly produced. In one embodiment, the language "substantially 

free of cellular material" includes preparations of mGluRSM protein having less than 
1 5 about 30% (by dry weight) of non-mGluRSM protein (also referred to herein as a 
"contaminating protein"), more preferably less than about 20% of non-mGluRSM 
protein, still more preferably less than about 10% of non-mGluR5M protein, and most 
preferably less than about 5% non-mGluRSM protein. When the mGluR5M protein or 
biologically active portion thereof is recombinantly produced, it is also preferably 
20 substantially free of culture medium, i.e. , culture medium represents less than about 
20%o, more preferably less than about 10%, and most preferably less than about 5% of 
the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of mGluRSM protein in which the protein is separated from 
25 chemical precursors or other chemicals which are involved in the synthesis of the 

protein. In one embodiment, the language "substantially free of chemical precursors or 
other chemicals" includes preparations of mGluRSM protein having less than about 30% 
(by dry weight) of chemical precursors or non-mGluRSM chemicals, more preferably 
less than about 20% chemical precursors or non-mGluRSM chemicals, still more 
30 preferably less than about 10% chemical precursors or non-mGluRSM chemicals, and 
most preferably less than about 5% chemical precursors or non-mGluR5M chemicals. 

Biologically active portions of a mGluRSM protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the mGluRSM protein, e.g., the amino acid sequence shown in SEQ ID 
35 NO:2, which include less amino acids than the full length mGluRSM proteins, and 
exhibit at least one activity of a mGluRSM protein. Typically, biologically active 
portions comprise a domain or motif with at least one activity of the mGluRSM protein. 
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A biologically active portion of a mGluRSM protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. 

In one embodiment, a biologically active portion of a mGluRSM protein 
comprises at least an N-terminal mGluR-like domain and/or a C-terminal unique 
5 domain. In another embodiment, a biologically active portion of a mGluR5M protein 
comprises at least one G protein coupled receptor family 31 consensus sequence. 

It is to be understood that a preferred biologically active portion of a mGluR5M 
protein of the present invention may contain at least one of the above-identified 
structural domains and/or profiles. A more preferred biologically active portion of a 
10 mGluR5M protein may contain at least two of the above-identified structural domains 
and/or profiles. Moreover, other biologically active portions, in which other regions of 
the protein are deleted, can be prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native mGluR5M protein. 

In a preferred embodiment, the mGluR5M protein has an amino acid sequence 
1 5 shown in SEQ ID NO:2. In other embodiments, the mGluR5M protein is substantially 
homologous or identical to SEQ ID NO:2, and retains the functional activity of the 
protein of SEQ ID NO:2, yet differs in amino acid sequence due to natural allelic 
variation or mutagenesis, as described in detail in subsection I above. Accordingly, in 
another embodiment, the mGluR5M protein is a protein which comprises an amino acid 
20 sequence at least about 60% identical to the amino acid sequence of SEQ ID NO:2 and 
retains the functional activity of the mGluRSM proteins of SEQ ID NO:2, respectively. 
Preferably, the protein is at least about 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to the amino acid sequence of 
SEQ ID NO:2 and retains the functional activity of the mGluR5M proteins of SEQ ID 
25 NO:2. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-identical sequences can be disregarded for 
30 comparison purposes). In a preferred embodiment, the length of a reference sequence 
aligned for comparison purposes is at least 30%), preferably at least 40%, more 
preferably at least 50%, even more preferably at least 60%, and even more preferably at 
least 70%>, 80%, or 90% of the length of the reference sequence (e.g., when aligning a 
second sequence to the mGluR5M amino acid sequence of SEQ ID NO:2 having 369 
35 amino acid residues, at least 111, preferably at least 148, more preferably at least 185, 
even more preferably at least 221, and even more preferably at least 258, 295 or 332 
amino acid residues are aligned). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
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position in the first sequence is occupied by the same amino acid residue or nucleotide 
as the corresponding position in the second sequence, then the molecules are identical at 
that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino 
acid or nucleic acid "homology"). The percent identity between the two sequences is a 
5 function of the number of identical positions shared by the sequences, taking into 

account the number of gaps, and the length of each gap, which need to be introduced for 
optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. In a preferred 
10 embodiment, the percent identity between two amino acid sequences is determined using 
the Needleman and Wunsch {J. Mol Biol (48):444-453 (1970)) algorithm which has 
been incorporated into the GAP program in the GCG software package (available at 
http://www.gcg.com), using either a Blosum 62 matrix or a PAM250 matrix, and a gap 
weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet 
15 another preferred embodiment, the percent identity between two nucleotide sequences is 
determined using the GAP program in the GCG software package (available at 
http://www.gcg.com), using aNWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 
70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A preferred, non-limiting example of 
parameters to be used in conjunction with the GAP program include a Blosum 62 
H 20 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap 

H penalty of 5. 

In another embodiment, the percent identity between two amino acid or 
nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller 
(Comput. Appl BioscL, 4:1 1-17 (1988)) which has been incorporated into the ALIGN 
25 program (version 2.0 or version 2.0U), using a PAM120 weight residue table, a gap 
length penalty of 12 and a gap penalty of 4. 

The nucleic acid and polypeptide sequences of the present invention can further 
be used as a "query sequence" to perform a search against public databases to, for 
example, identify other family members or related sequences. Such searches can be 
30 performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al 
(1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to mGluRSM nucleic acid molecules of the invention. BLAST protein 
searches can be performed with the XBLAST program, score = 100, wordlength = 3, and 
35 a Blosum62 matrix to obtain amino acid sequences homologous to mGluR5M 

polypeptide molecules of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in Altschul et al , (1 997) Nucleic 
Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
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the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. 

Proteins and protein fragments of the present invention include proteins with 
amino acid sequence lengths that are at least 25% (more preferably at least 50%, and 
5 even more preferably preferably at least 75%) of the length of a disclosed protein and 
have at least 60-70% sequence identity (more preferably, at least 70-75% identity and 
even more preferably at least 75-80%, 80-85%, 85-90%, 90-95% or more identity) with 
that disclosed protein, where sequence identity is determined as described herein. 

The invention also provides mGluR5M chimeric or fusion proteins. As used 
10 herein, a mGluRSM "chimeric protein" or "fusion protein" comprises a mGluR5M 
polypeptide operatively linked to a non-mGluR5M polypeptide. A "mGluR5M 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to 
mGluR5M, whereas a "non-mGluR5M polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to a protein which is not substantially homologous 
15 to the mGluR5M protein, e.g. , a protein which is different from the mGluRSM protein 
and which is derived from the same or a different organism. Within a mGluRSM fusion 
protein the mGluR5M polypeptide can correspond to all or a portion of a mGluRSM 
protein. In a preferred embodiment, a mGluR5M fusion protein comprises at least one 
biologically active portion of a mGluR5M protein. In another preferred embodiment, a 
20 mGluRSM fusion protein comprises at least two biologically active portions of a 

mGluRSM protein. Within the fusion protein, the term "operatively linked" is intended 
to indicate that the mGluRSM polypeptide and the non-mGluR5M polypeptide are fused 
in-frame to each other. The non-mGluR5M polypeptide can be fused to the N-terminus 
or C-terminus of the mGluRSM polypeptide. 
25 For example, in one embodiment, the fusion protein is a GST-mGluR5M fusion 

protein in which the mGluRSM sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the purification of recombinant 
mGluRSM. In another embodiment, the fusion protein is a mGluRSM protein 
containing a heterologous signal sequence at its N-terminus. In certain host cells (e.g., 
30 mammalian host cells), expression and/or secretion of mGluR5M can be increased 
through use of a heterologous signal sequence. 

The mGluRSM fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject in vivo. The mGluRSM 
fusion proteins can be used to affect the bioavailability of a mGluR5M substrate. Use of 
35 mGluRSM fusion proteins may be useful therapeutically for the treatment of central 
nervous system disorders. Moreover, the mGluRSM-fusion proteins of the invention 
can be used as immunogens to produce anti-mGluR5M antibodies in a subject, to purify 



O 

:= = r, 
Hi 

%J 

fii 
W 



-26- 



ATTORNEY DOCKET NO.: GNN-024 



mGluRSM ligands and in screening assays to identify molecules which inhibit the 
interaction of mGluR5M with a mGluR5M ligand or with a mGluR. 

Preferably, a mGluRSM chimeric or fusion protein of the invention is produced 
by standard recombinant DNA techniques. For example, DNA fragments coding for the 
5 different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene 
10 can be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and reamplified to generate a chimeric 
fU gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 

15 et al John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A mGluRSM- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
moiety is linked in-frame to the mGluRSM protein. 

The present invention also pertains to variants of the mGluRSM proteins which 
20 function as either mGluRSM agonists (mimetics) or as mGluR5M antagonists. Variants 
of the mGluR5M proteins can be generated by mutagenesis, e.g., discrete point mutation 
or truncation of a mGluRSM protein. An agonist of the mGluRSM proteins can retain 
substantially the same, or a subset, of the biological activities of the naturally occurring 
form of a mGluRSM protein. An antagonist of a mGluRSM protein can inhibit one or 
25 more of the activities of the naturally occurring form of the mGluRSM protein by, for 
example, competitively inhibiting the protease activity of a mGluRSM protein. Thus, 
specific biological effects can be elicited by treatment with a variant of limited function. 
In one embodiment, treatment of a subject with a variant having a subset of the 
biological activities of the naturally occurring form of the protein has fewer side effects 
30 in a subject relative to treatment with the naturally occurring form of the mGluRSM 
protein. 

In one embodiment, variants of a mGluRSM protein which function as either 
mGluR5M agonists (mimetics) or as mGluRSM antagonists can be identified by 
screening combinatorial libraries of mutants, e.g., truncation mutants, of a mGluRSM 
35 protein for mGluRSM protein agonist or antagonist activity. In one embodiment, a 
variegated library of mGluRSM variants is generated by combinatorial mutagenesis at 
the nucleic acid level and is encoded by a variegated gene library. A variegated library 
of mGluRSM variants can be produced by, for example, enzymatically ligating a 
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mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
potential mGluRSM sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set 
of mGluR5M sequences therein. There are a variety of methods which can be used to 
5 produce libraries of potential mGluRSM variants from a degenerate oligonucleotide 
sequence. Chemical synthesis of a degenerate gene sequence can be performed in an 
automatic DNA synthesizer, and the synthetic gene then ligated into an appropriate 
expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential mGluR5M 
1 0 sequences. Methods for synthesizing degenerate oligonucleotides are known in the art 
(see, e.g., Narang, S.A. (1983) Tetrahedron 39:3; Itakura^tf/. (1984) Annu. Rev. 
Biochem. 53:323; Itakura etal (1984) Science 198:1056; Ike et al (1983) Nucleic Acid 
U Res. 11:477. 

til In addition, libraries of fragments of a mGluR5M protein coding sequence can be 

^ 15 used to generate a variegated population of mGluRSM fragments for screening and 

f|j subsequent selection of variants of a mGluRSM protein. In one embodiment, a library 

of coding sequence fragments can be generated by treating a double stranded PCR 
M fragment of a mGluRSM coding sequence with a nuclease under conditions wherein 

iy nicking occurs only about once per molecule, denaturing the double stranded DNA, 

fli 

i : 20 renaturing the DNA to form double stranded DNA which can include sense/antisense 

Cj pairs from different nicked products, removing single stranded portions from reformed 

duplexes by treatment with SI nuclease, and ligating the resulting fragment library into 
an expression vector. By this method, an expression library can be derived which 
encodes N-terminal, C -terminal and internal fragments of various sizes of the mGluRSM 
25 protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 

30 mGluRSM proteins. The most widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries typically include cloning the 
gene library into replicable expression vectors, transforming appropriate cells with the 
resulting library of vectors, and expressing the combinatorial genes under conditions in 
which detection of a desired activity facilitates isolation of the vector encoding the gene 

35 whose product was detected. Recrusive ensemble mutagenesis (REM), a new technique 
which enhances the frequency of functional mutants in the libraries, can be used in 
combination with the screening assays to identify mGluRSM variants (Arkin and 
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Yourvan (1992) PNAS 59:781 1-7815; Delgrave et al (1993) Protein Engineering 
6(3):327-331). 

In one embodiment, cell based assays can be exploited to analyze a variegated 
mGluR5M library. For example, a library of expression vectors can be transfected into a 
5 cell line which ordinarily synthesizes mGluRSM. The transfected cells are then cultured 
such that a particular mutant mGluR5M is expressed and the effect of expression of the 
mutant on mGluRSM activity in the cell can be detected, e.g., by any of a number of 
activity assays for native mGluRSM protein. Plasmid DNA can then be recovered from 
the cells which score for modulated mGluR5M activity, and the individual clones further 
1 0 characterized. 

An isolated mGluR5M protein, or a portion or fragment thereof, can be used as 
M an immunogen to generate antibodies that bind mGluR5M using standard techniques for 

l;l polyclonal and monoclonal antibody preparation. A full-length mGluR5M protein can 

f || be used or, alternatively, the invention provides antigenic peptide fragments of 

1 5 mGluRSM for use as immunogens. The antigenic peptide of mGluR5M comprises at 
least 8 amino acid residues of the amino acid sequence shown in SEQ ID NO:2 and 
W encompasses an epitope of mGluRSM such that an antibody raised against the peptide 

[ . forms a specific immune complex with mGluR5M. Preferably, the antigenic peptide 

FU comprises at least 10 amino acid residues, more preferably at least 15 amino acid 

j! *f 20 residues, even more preferably at least 20 amino acid residues, and most preferably at 

O least 30 amino acid residues. 

^ Preferred epitopes encompassed by the antigenic peptide are regions of 

mGluRSM that are unique to the mGluRSM protein (e.g., within the C-terminal unique 
domain from about amino acids 304 to 369 of SEQ ID NO:2). 

25 A mGluR5M immunogen typically is used to prepare antibodies by immunizing 

a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example, recombinantly 
expressed mGluR5M protein or a chemically synthesized mGluRSM polypeptide. The 
preparation can further include an adjuvant, such as Freund's complete or incomplete 

30 adjuvant, or similar immuno stimulatory agent. Immunization of a suitable subject with 
an immunogenic mGluRSM preparation induces a polyclonal anti-mGluR5M antibody 
response. 

Accordingly, another aspect of the invention pertains to anti-mGluRSM 
antibodies. The term "antibody" as used herein refers to immunoglobulin molecules and 
35 immunologically active portions of immunoglobulin molecules, i.e. , molecules that 

contain an antigen binding site which specifically binds (immunoreacts with) an antigen, 
such as mGluR5M. Examples of immunologically active portions of immunoglobulin 
molecules include F(ab) and F(ab') 2 fragments which can be generated by treating the 
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antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind mGluR5M. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
5 immunoreacting with a particular epitope of mGluRSM. A monoclonal antibody 

composition thus typically displays a single binding affinity for a particular mGluRSM 
protein with which it immunoreacts. 

Polyclonal anti-mGluR5M antibodies can be prepared as described above by 
immunizing a suitable subject with a mGluR5M immunogen. The anti-mGluR5M 
10 antibody titer in the immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 
immobilized mGluRSM. If desired, the antibody molecules directed against mGluRSM 
c:j can be isolated from the mammal (e.g. , from the blood) and further purified by well 

J*| known techniques, such as protein A chromatography to obtain the IgG fraction. At an 

, □ 1 5 appropriate time after immunization, e.g. , when the anti-mGluR5M antibody titers are 

highest, antibody-producing cells can be obtained from the subject and used to prepare 
T monoclonal antibodies by standard techniques, such as the hybridoma technique 

M originally described by Kohler and Milstein (1 975) Nature 256:495-497) (see also, 

}){ Brown et al (1981) J. Immunol 127:539-46; Brown et al. (1980) J. Biol Chem 

P 20 .255:4980-83; Yeh et al (1976) PNAS 76:2927-31; and Yeh et al (1982) Int. J Cancer 

; J 29:269-75), the more recent human B cell hybridoma technique (Kozbor et al (1 983) 

Immunol Today 4:72), the EBV-hybridoma technique (Cole et al (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. 
The technology for producing monoclonal antibody hybridomas is well known (see 
25 generally R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological 
Analyses, Plenum Publishing Corp., New York, New York (1980); E. A. Lerner (1981) 
Yale J. Biol Med, 54:387-402; M. L. Gefter et al (1977) Somatic Cell Genet. 
3:231-36). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a mGluRSM immunogen as 
30 described above, and the culture supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal antibody that binds 
mGluRSM. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-mGluR5M 
35 monoclonal antibody (see, e.g., G. Galfre et al (1977) Nature 266:55052; Gefter et al 
Somatic Cell Genet, cited supra', Lerner, Yale J. Biol. Med., cited supra; Kenneth, 
Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker will 
appreciate that there are many variations of such methods which also would be useful. 
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Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the same 
mammalian species as the lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an immunogenic preparation of 
the present invention with an immortalized mouse cell line. Preferred immortal cell 
5 lines are mouse myeloma cell lines that are sensitive to culture medium containing 
hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a number of 
myeloma cell lines can be used as a fusion partner according to standard techniques, 
e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. These 
myeloma lines are available from ATCC. Typically, HAT-sensitive mouse myeloma 
1 0 cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma 

cells resulting from the fusion are then selected using HAT medium, which kills unfused 
y, and unproductively fused myeloma cells (unfused splenocytes die after several days 

g because they are not transformed). Hybridoma cells producing a monoclonal antibody 

f [j of the invention are detected by screening the hybridoma culture supernatants for 

*M 1 5 antibodies that bind mGluR5M, e.g. , using a standard ELISA assay. 

lu Alternative to preparing monoclonal antibody-secreting hybridomas, a 

yj monoclonal anti-mGluR5M antibody can be identified and isolated by screening a 

f , recombinant combinatorial immunoglobulin library (e.g. , an antibody phage display 

hi library) with mGluR5M to thereby isolate immunoglobulin library members that bind 

20 mGluR5M. Kits for generating and screening phage display libraries are commercially 
available (e.g., the Pharmacia Recombinant Phage Antibody System, Catalog No. 27- 
9400-01; and the Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 
Additionally, examples of methods and reagents particularly amenable for use in 
generating and screening antibody display library can be found in, for example, Ladner 
25 et al. U.S. Patent No. 5,223,409; Kang et al. PCT International Publication No. WO 
92/18619; Dower et al. PCT International Publication No. WO 91/17271; Winter et al. 
PCT International Publication WO 92/20791; Markland et al. PCT International 
Publication No. WO 92/15679; Breitling et al. PCT International Publication WO 
93/01288; McCafferty et al. PCT International Publication No. WO 92/01047; Garrard 
30 et al. PCT International Publication No. WO 92/09690; Ladner et al. PCT International 
Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et 
al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275- 
1281; Griffiths et al. (1993) EMBO J 12:725-734; Hawkins et al. (1992) J. Mol. Biol. 
226:889-896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 
35 89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; Hoogenboom et al. 
(1991) Nuc. Acid Res. 19:4133-4137; Barbas et al. (1991) PNAS 88:7978-7982; and 
McCafferty et al. Nature (1990) 348:552-554. 
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Additionally, recombinant anti-mGluR5M antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope of 
the invention. Such chimeric and humanized monoclonal antibodies can be produced by 
5 recombinant DNA techniques known in the art, for example using methods described in 
Robinson et al International Application No. PCT/US86/02269; Akira, et al European 
Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; 
Morrison et al European Patent Application 173,494; Neuberger et al PCT 
International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,816,567; 
10 Cabilly et al European Patent Application 125,023; Better et al (1988) Science 
240:10414043; Liu etal (1987) PNAS 84:3439-3443; Liu et al (1987) J. Immunol 
139:3521-3526; Sun et al (1987) PNAS 84:214-218; Nishimura et al (1987) Cane. Res. 
47:999-1005; Wood et al (1985) Nature 314:446-449; and Shaw et al (1988) J. Natl 
pi Cancer Inst. 80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al 

H 15 (1986) BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al (1986) Nature 

jf{ 321:552-525; Verhoeyaner al (1988) Science 239:1534; and Bei&ler et al (1988),/ 

hi Immunol 141:4053-4060. 

■ An anti-mGluR5M antibody (e.g. , monoclonal antibody) can be used to isolate 

mGluR5M by standard techniques, such as affinity chromatography or 
20 immunoprecipitation. An anti-mGluR5M antibody can facilitate the purification of 
natural mGluRSM from cells and of recombinantly produced mGluR5M expressed in 
host cells. Moreover, an anti-mGluR5M antibody can be used to detect mGluRSM 
protein (e.g. , in a cellular lysate or cell supernatant) in order to evaluate the abundance 
and pattern of expression of the mGluR5M protein. Anti-mGluR5M antibodies can be 
25 used diagnostically to monitor protein levels in tissue as part of a clinical testing 

procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
30 materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of 
suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; 
examples of suitable fluorescent materials include umbelliferone, fluorescein, 
fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl 
35 chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 
examples of suitable radioactive material include 125 I, 13 1 I, 35 S or 3 H. 
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III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a mGluRSM protein (or a portion thereof). 
As used herein, the term "vector" refers to a nucleic acid molecule capable of 
5 transporting another nucleic acid to which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA loop into which additional 
DNA segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell into which they are introduced (e.g., 
1 0 bacterial vectors having a bacterial origin of replication and episomal mammalian 

vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
P genome of a host cell upon introduction into the host cell, and thereby are replicated 

pi along with the host genome. Moreover, certain vectors are capable of directing the 

tli expression of genes to which they are operatively linked. Such vectors are referred to 

1 5 herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
w "plasmid" and "vector" can be used interchangeably as the plasmid is the most 

commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
T[ 20 retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 

functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
25 sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
30 host cell when the vector is introduced into the host cell). The term "regulatory 

sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
35 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
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host cell to be transformed, the level of expression of protein desired, etc. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., mGluRSM proteins, mutant forms of mGluRSM proteins, fusion 
5 proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of mGluRSM proteins in prokaryotic or eukaryotic cells. For example, 
mGluRSM proteins can be expressed in bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are 
1 0 discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, CA (1990). Alternatively, the recombinant expression 
vector can be transcribed and translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase, 
f y Expression of proteins in prokaryotes is most often carried out in E. coli with 

fc p{ 15 vectors containing constitutive or inducible promoters directing the expression of either 

FU fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 

hi 

encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
M vectors typically serve three purposes: 1) to increase expression of recombinant protein; 

*** 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 

p 20 of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

O expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 

moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
25 Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in mGluR5M activity assays, (e.g., direct 
30 assays or competitive assays described in detail below), or to generate antibodies 

specific for mGluRSM proteins, for example. In a preferred embodiment, a mGluR5M 
fusion protein expressed in a retroviral expression vector of the present invention can be 
utilized to infect bone marrow cells which are subsequently transplanted into irradiated 
recipients. The pathology of the subject recipient is then examined after sufficient time 
35 has passed (e.g six (6) weeks). 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al, (1988) Gene 69:301-315) and pET 1 Id (Studier etal., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
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California (1 990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET lid vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
5 polymerase is supplied by host strains BL21 (DE3) or HMS 1 74(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E, coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
10 recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al, (1992) Nucleic Acids Res. 20:21 1 1-2118). 
1 5 Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the mGluRSM expression vector is a yeast expression 
£ vector. Examples of vectors for expression in yeast S. cerivisae include pYepSecl 

EI (Baldari, et al, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 

Hi 20 30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:1 13-123), pYES2 (Invitrogen 

Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, mGluRSM proteins can be expressed in insect cells using 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins 
in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al (1983) Mol 
25 Cell Biol 3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 
170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
30 (Kaufman efa/. (1987)EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F. ? 
35 and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed. } Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 
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In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
5 tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
1 0 (e.g. , the neurofilament promoter; Byrne and Ruddle (1 989) PNAS 86:5473-5477), 

pancreas-specific promoters (Edlund et al (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and 
European Application Publication No. 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Grass (1990) 
15 Science 249:374-379) and the oc-fetoprotein promoter (Campes and Tilghman (1989) 
Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
20 a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to mGluRSM mRNA. Regulatory sequences 
operatively linked to a nucleic acid cloned in the antisense orientation can be chosen 
which direct the continuous expression of the antisense RNA molecule in a variety of 
cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be 
25 chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced 
under the control of a high efficiency regulatory region, the activity of which can be 
determined by the cell type into which the vector is introduced. For a discussion of the 
30 regulation of gene expression using antisense genes see Weintraub, H. et al, Antisense 
RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) 
1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
35 "recombinant host cell" are used interchangeably herein. It is understood that such 

terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 
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identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a mGluRSM 
protein can be expressed in bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid {e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al (Molecular Cloning: A 
Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding a mGluRSM protein or 
can be introduced on a separate vector. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection (e.g., cells that have incorporated the 
selectable marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a mGluRSM protein. Accordingly, the 
invention further provides methods for producing a mGluR5M protein using the host 
cells of the invention. In one embodiment, the method comprises culturing the host cell 
of invention (into which a recombinant expression vector encoding a mGluRSM protein 
has been introduced) in a suitable medium such that a mGluRSM protein is produced. In 
another embodiment, the method further comprises isolating a mGluRSM protein from 
the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which mGluRSM-coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 
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which exogenous mGluRSM sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous mGluR5M sequences have been 
altered. Such animals are useful for studying the function and/or activity of a mGluRSM 
and for identifying and/or evaluating modulators of mGluRSM activity. As used herein, 
5 a "transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 

1 0 remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous mGluR5M gene has been 
altered by homologous recombination between the endogenous gene and an exogenous 

1 5 DNA molecule introduced into a cell of the animal, e.g. , an embryonic cell of the 
animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a mGluRSM- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 

20 pseudopregnant female foster animal. The mGluRSM cDNA sequence, e.g. , that of SEQ 
ID NO:l or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number PTA-2775, can be introduced as a transgene into the 
genome of a non-human animal. Alternatively, a nonhuman orthologue of a human 
mGluRSM gene, such as a mouse or rat mGluRSM gene, can be used as a transgene. 

25 Alternatively, a mGluR5M gene homologue can be isolated based on hybridization to 
the mGluRSM cDNA sequences of SEQ ID NO:l or the nucleotide sequence of the 
DNA insert of the plasmid deposited with ATCC as Accession Number PTA-2775 
(described further in subsection I above) and used as a transgene. Intronic sequences 
and polyadenylation signals can also be included in the transgene to increase the 

30 efficiency of expression of the transgene. A tissue-specific regulatory sequence(s) can 
be operably linked to a mGluRSM transgene to direct expression of a mGluRSM protein 
to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both 

35 by Leder et al , U. S. Patent No. 4,873, 1 9 1 by Wagner et al and in Hogan, B., 

Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986). Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of a 



-38- 



ATTORNEY DOCKET NO.: GNN-024 



mGluR5M transgene in its genome and/or expression of mGluR5M mRNA in tissues or 
cells of the animals. A transgenic founder animal can then be used to breed additional 
animals carrying the transgene. Moreover, transgenic animals carrying a transgene 
encoding a mGluRSM protein can further be bred to other transgenic animals carrying 
5 other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a mGluRSM gene into which a deletion, addition or substitution has 
been introduced to thereby alter, e.g., functionally disrupt, the mGluRSM gene. The 
mGluR5M gene can be a human gene (e.g., the cDNA of SEQ ID NO:l), but more 
1 0 preferably, is a non-human homologue of a human mGluR5M gene (e.g. , a cDNA 

isolated by stringent hybridization with the nucleotide sequence of SEQ ID NO:l). For 
pi example, a mouse mGluRSM gene can be used to construct a homologous 

recombination vector suitable for altering an endogenous mGluRSM gene in the mouse 
genome. In a preferred embodiment, the vector is designed such that, upon homologous 
15 recombination, the endogenous mGluR5M gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
vector can be designed such that, upon homologous recombination, the endogenous 
mGluR5M gene is mutated or otherwise altered but still encodes functional protein (e.g., 
the upstream regulatory region can be altered to thereby alter the expression of the 
20 endogenous mGluR5M protein). In the homologous recombination vector, the altered 
portion of the mGluR5M gene is flanked at its 5 f and 3 f ends by additional nucleic acid 
sequence of the mGluR5M gene to allow for homologous recombination to occur 
between the exogenous mGluR5M gene carried by the vector and an endogenous 
mGluRSM gene in an embryonic stem cell. The additional flanking mGluRSM nucleic 
25 acid sequence is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' 
ends) are included in the vector (see e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 
5 1 :503 for a description of homologous recombination vectors). The vector is 
introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which 
30 the introduced mGluRSM gene has homologously recombined with the endogenous 
mGluR5M gene are selected (see e.g., Li, E. et al. (1992) Cell 69:915). The selected 
cells are then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation 
chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) pp. 113-152). A chimeric 
35 embryo can then be implanted into a suitable pseudopregnant female foster animal and 
the embryo brought to term. Progeny harboring the homologously recombined DNA in 
their germ cells can be used to breed animals in which all cells of the animal contain the 
homologously recombined DNA by germline transmission of the transgene. Methods 
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for constructing homologous recombination vectors and homologous recombinant 
animals are described further in Bradley, A. (1991) Current Opinion in Biotechnology 
2:823-829 and in PCT International Publication Nos.: WO 90/1 1354 by Le Mouellec et 
at; WO 91/01140 by Smithies et at; WO 92/0968 by Zijlstra et at; and WO 93/04169 
5 by Berns et at 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) PNAS 

10 89:6232-6236. Another example of a recombinase system is the FLP recombinase 

system of Saccharomyces cerevisiae (O'Gorman et at (1991) Science 251:1351-1355. If 
a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 

15 animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, L et at (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 

20 induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 
of the same species from which the quiescent cell is isolated. The recontructed oocyte is 
then cultured such that it develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne of this female foster animal 

25 will be a clone of the animal from which the cell, e.g. , the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 

The mGluR5M nucleic acid molecules, mGluRSM proteins, anti-mGluR5M 
antibodies, mGluR5M ligands, peptide, peptidomimetics and/or mGluR5M modulators 

30 (also referred to herein as "active compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, antibody, or modulatory compound and a 
pharmaceutical^ acceptable carrier. As used herein the language "pharmaceutical^ 
acceptable carrier" is intended to include any and all solvents, dispersion media, 

35 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 

and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutical^ active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
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in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
5 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

1 0 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

1 5 enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 

20 intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 

25 bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 

30 surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 

35 brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a mGluR5M protein or anti-mGluR5M antibody) in the required 
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amount in an appropriate solvent with one or a combination of ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared 
by incorporating the active compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from those enumerated above. In 
the case of sterile powders for the preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutical^ 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
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release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 
5 The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

10 It is especially advantageous to formulate oral or parenteral compositions in 

dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 

1 5 pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by 

20 standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 

25 compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 

30 preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal models to achieve a 

35 circulating plasma concentration range that includes the IC50 (i.e. , the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
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useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
5 example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et al (1994) PNAS 91 :3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy 
vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 
delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
1 0 can be produced intact from recombinant cells, e.g. , retroviral vectors, the 

pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

15 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, peptides, antibodies 
and the like described herein can be used in one or more of the following methods: a) 
screening assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, 

20 monitoring clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., 

therapeutic and prophylactic). As described herein, a mGluR5M protein of the invention 
has one or more of the following activities: (1) modulation of G protein linked second 
messenger signaling pathways (e.g., modulation of diacylglycerol and/or inisitol 
triphosphate-mediated signaling pathways); (2) modulation of glutamatergic 

25 transmission; (3) modulation of neuronal excitability; (4) regulation of synaptic 

transmission; (5) modulation of neurotransmitter release (e.g., glutamate release); (6) 
regulation of voltage-dependent and/or voltage-independent and/or ligand-gated ion 
channels (e.g., K+ channels or Ca2+ channels); (7) regulation of neuronal development 
(e.g., regulation of neuronal differentiation, migration and/or survival in the developing 

30 and/or mature brain); (8) modulation of neurodegenerative processes (e.g., acute or 

chronic neurodegenerative processes); and (9) modulation of mGluR5 dimerization (e.g., 
mGluR5a and/or mGluR5b dimerization) and/or modulation of dimerization of other 
mGluR family members (e.g., mGluRl dimerization). Accordingly, the isolated nucleic 
acid molecules of the invention can be used, for example, to express mGluR5M protein 

35 (e.g. , via a recombinant expression vector in a host cell in gene therapy applications), to 
detect mGluR5M mRNA (e.g., in a biological sample) or a genetic alteration in a 
mGluR5M gene, and to modulate mGluR5M activity, as described further below. The 
mGluR5M proteins can be used to treat disorders characterized by insufficient or 
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excessive production of a mGluRSM protein and/or mGluR5M ligand. In addition, the 
mGluR5M proteins can be used to screen drugs or compounds which modulate the 
mGluRSM activity as well as to treat disorders characterized by insufficient or excessive 
production of mGluR5M protein or production of mGluRSM protein forms which have 
decreased or aberrant activity compared to mGluRSM wild type protein. Moreover, the 
anti-mGluRSM antibodies of the invention can be used to detect and isolate mGluR5M 
proteins, regulate the bioavailability of mGluR5M proteins, and modulate mGluRSM 
activity. 

A. Screening Assays : 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to mGluRSM proteins, or 
have a stimulatory or inhibitory effect on, for example, mGluRSM expression or 
mGluR5M activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of a mGluRSM protein or 
polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the 'one-bead one-compound* library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds (Lam, K.S. (1997) 
Anticancer Drug Des. 12:1 45). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl Acad. Set U.S.A. 90:6909; Erb et 

(1994) Proc. Natl. Acad. Set USA 91:11422; Zuckermann et al (1994). J. Med. 
Chem. 37:2678; Cho et al (1993) Science 261:1303; Carrell et al (1994) Angew. Chem. 
Int. Ed. Engl 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 
Galloped/. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
(Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
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(Cwirla et al. (1990) Proa Natl Acad. Set 87:6378-6382); (Felici (1991) J. Mol Biol 
222:301-310); (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a mGluR protein on the cell surface is contacted with a mGluRSM protein and 
5 a test compound and the ability of the test compound to modulate the binding of the 
mGluRSM protein to the mGluR determined. The cell, for example, can be of 
mammalian origin or a yeast cell. Determining the ability of the test compound to 
modulate the binding of the mGluR5M protein to the mGluR can be accomplished, for 
example, by coupling the test compound or the mGluRSM protein with a radioisotope or 
10 enzymatic label such that binding of the test compound or the mGluR5M protein to the 
mGluR protein can be determined by detecting the labeled compound in a complex. For 
example, test compounds can be labeled with 125 I, 35 S, 14 c, or 3 H, either directly or 
indirectly, and the radioisotope detected by direct counting of radioemmission or by 
scintillation counting. Alternatively, compounds or proteins can be enzymatically 
1 5 labeled with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, 
and the enzymatic label detected by determination of conversion of an appropriate 
substrate to product. 

It is also within the scope of this invention to determine the ability of a test 
compound to modulate interaction of a mGluRSM protein without the labeling of any of 
20 the interactants. For example, a microphysiometer can be used to detect the interaction 
of a test compound with a mGluR5M protein without the labeling of either the test 
compound or the mGluRSM protein. McConnell, H. M. et al (1992) Science 
257:1906-1912. As used herein, a "microphysiometer" (e.g., Cytosensor™) is an 
analytical instrument that measures the rate at which a cell acidifies its environment 
25 using a light-addressable potentiometric sensor (LAPS). Changes in this acidification 
rate can be used as an indicator of the interaction between ligand and receptor. 

In a preferred embodiment, the assay comprises contacting a cell which 
expresses a mGluR5 protein on the cell surface with a mGluR5 ligand, to form an assay 
mixture, contacting the assay mixture with a mGluR5M protein or variant, optionally in 
30 addition to a test compound, and determining the ability of the test compound or the 
mGluR5M protein or variant to interact with the mGluRS protein. In one embodiment, 
determining the ability of the mGluRSM protein or variant or test compound to interact 
with the mGluR5 protein comprises determining the ability of the mGluR5M protein or 
variant or test compound to preferentially bind to the mGluRS protein, as compared to 
35 the ability of the mGluRSM to bind to the mGluR5 protein. 

Determining the ability of the mGluRSM protein ortest compound to bind to or 
interact with a mGluR5 protein can be accomplished by one of the methods described 
above for determining direct binding. In a preferred embodiment, determining the 
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ability of the mGluR5M protein or test compound to bind to or interact with a mGluR5 
protein can be accomplished by determining the activity of the mGluR5 protein or of a 
downstream mGluR5 target molecule. For example, the target molecule can be a 
cellular second messenger, and the activity of the target molecule can be determined by 

5 detecting induction of of the target {i.e. intracellular Ca 2+ , diacylglycerol, IP3, etc.), 
detecting catalytic/enzymatic activity of the target on an appropriate substrate, detecting 
the induction of a reporter gene (comprising a mGluRS -responsive regulatory element 
operatively linked to a nucleic acid encoding a detectable marker, e.g., luciferase), or 
detecting a cellular response, for example, a proliferative response or a neuronal 

10 response. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which a mGluR5M protein or biologically active portion thereof is contacted with a 
test compound and the ability of the test compound to bind to the mGluR5M protein or 
biologically active portion thereof is determined. Binding of the test compound to the 
1 5 mGluRSM protein can be determined either directly or indirectly as described above. 
Binding of the test compound to the mGluRSM protein can also be accomplished using 
a technology such as real-time Biomolecular Interaction Analysis (BIA). Sjolander, S. 
and Urbaniczky, C. (1991) Anal. Chem. 63:2338-2345 and Szabo et al. (1995) Curr. 
Opin. Struct Biol. 5:699-705. As used herein, "BIA" is a technology for studying 
20 biospecific interactions in real time, without labeling any of the interactants (e.g. , 

BIAcore™). Changes in the optical phenomenon of surface plasmon resonance (SPR) 
can be used as an indication of real-time reactions between biological molecules. 

In a preferred embodiment, the assay includes contacting the mGluR5M protein 
or biologically active portion thereof with a known ligand which binds mGluR5M to 
25 form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with a mGluRSM protein, 
wherein determining the ability of the test compound to interact with a mGluR5M 
protein comprises determining the ability of the test compound to preferentially bind to 
mGluRSM or biologically active portion thereof as compared to the known ligand. 
30 In another embodiment, the assay is a cell-free assay in which a mGluR5M 

protein or biologically active portion thereof is contacted with a test compound and the 
ability of the test compound to modulate (e.g., stimulate or inhibit) the activity of the 
mGluRSM protein or biologically active portion thereof is determined. Determining the 
ability of the test compound to modulate the activity of a mGluRSM protein can be 
35 accomplished, for example, by determining the ability of the mGluR5M protein to 

modulate the activity of a downstream mGluRSM target molecule by one of the methods 
described above for cell-based assays. For example, the catalytic/enzymatic activity of 
the target molecule on an appropriate substrate can be determined as previously 
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described. Alternatively, the ability of mGluR5M to bind to or interact with a mGluR 
can be determined as previously described. 

In yet another embodiment, the cell-free assay involves contacting a mGluR5M 
protein or biologically active portion thereof with a known ligand which binds the 
5 mGluR5M protein and optionally a mGluR protein, to form an assay mixture, contacting 
the assay mixture with a test compound, and determining the ability of the test 
compound to interact with the mGluRSM protein, wherein determining the ability of the 
test compound to interact with the mGluR5M protein comprises determining the ability 
of the test compound to preferentially bind to or modulate the activity of a mGluRSM 

10 target molecule, as compared to the known ligand. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of isolated proteins (e.g. mGluR5M proteins or mGluRS 
proteins). In the case of cell-free assays in which a membrane-bound form an isolated 
protein is used (e.g., a mGluRS protein) it may be desirable to utilize a solubilizing agent 

1 5 such that the membrane-bound form of the isolated protein is maintained in solution. 
Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
Isotridecypoly(ethylene glycol ether) n , 3-[(3-cholamidopropyl)dimethylamminio]-l- 

20 propane sulfonate (CHAPS), 3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-l- 
propane sulfonate (CHAPSO), or N-dodecyl=N,N-dimethyl-3-ammonio-l -propane 
sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either mGluR5M or its target molecule to 

25 facilitate separation of complexed from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to a mGluRSM protein, or interaction of a mGluRSM protein with a target 
molecule in the presence and absence of a candidate compound, can be accomplished in 
any vessel suitable for containing the reactants. Examples of such vessels include 

30 microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided which adds a domain that allows one or both of the proteins to 
be bound to a matrix. For example, glutathione-S-transferase/ mGluRSM fusion 
proteins or glutathione-S-transferase/target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 

35 microtitre plates, which are then combined with the test compound or the test compound 
and either the non-adsorbed target protein or mGluR5M protein, and the mixture 
incubated under conditions conducive to complex formation (e.g., at physiological 
conditions for salt and pH). Following incubation, the beads or microtitre plate wells are 
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washed to remove any unbound components, the matrix immobilized in the case of 
beads, complex determined either directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the matrix, and the level of 
mGluRSM binding or activity determined using standard techniques. 
5 Other techniques for immobilizing proteins on matrices can also be used in the 

screening assays of the invention. For example, either a mGluRSM protein or a 
mGluR5M target molecule can be immobilized utilizing conjugation of biotin and 
streptavidin. Biotinylated mGluRSM protein or target molecules can be prepared from 
biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., 

10 biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in the wells of 

streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with mGluR5M protein or target molecules but which do not interfere with binding of 
the mGluRSM protein to its target molecule can be derivatized to the wells of the plate, 
and unbound target or mGluRSM protein trapped in the wells by antibody conjugation. 

1 5 Methods for detecting such complexes, in addition to those described above for the 

GST-immobilized complexes, include immunodetection of complexes using antibodies 
reactive with the mGluRSM protein or target molecule, as well as enzyme-linked assays 
which rely on detecting an enzymatic activity associated with the mGluRSM protein or 
target molecule. 

20 In another embodiment, modulators of mGluRSM expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
mGluR5M mRNA or protein in the cell is determined. The level of expression of 
mGluRSM mRNA or protein in the presence of the candidate compound is compared to 
the level of expression of mGluRSM mRNA or protein in the absence of the candidate 

25 compound. The candidate compound can then be identified as a modulator of 

mGluRSM expression based on this comparison. For example, when expression of 
mGluRSM mRNA or protein is greater (statistically significantly greater) in the presence 
of the candidate compound than in its absence, the candidate compound is identified as a 
stimulator of mGluRSM mRNA or protein expression. Alternatively, when expression 

30 of mGluRSM mRNA or protein is less (statistically significantly less) in the presence of 
the candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of mGluRSM mRNA or protein expression. The level of mGluRSM mRNA or 
protein expression in the cells can be determined by methods described herein for 
detecting mGluRSM mRNA or protein. 

35 In yet another aspect of the invention, the mGluRSM proteins can be used as 

"bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 
5,283,3 1 7; Zervos et al (1 993) Cell 72:223-232; Madura et al (1 993) J. Biol Chem. 
268:12046-12054; Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi et al 
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(1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other proteins, 
which bind to or interact with mGluRSM ("mGluRSM-binding proteins" or "mGluR5M- 
bp") and are involved in mGluR5M activity. Such mGluRSM-binding proteins are also 
likely to be involved in the propagation of signals by the mGluRSM proteins as, for 
5 example, downstream elements of a mGluRSM-mediated signaling pathway. 

Alternatively, such mGluR5M-binding proteins are likely to be cell-surface molecules 
associated with non-mGluR5M expressing cells, wherein such mGluRSM-binding 
proteins are involved in chemoattraction. 

The two-hybrid system is based on the modular nature of most transcription 
1 0 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for a 
mGluRSM protein is fused to a gene encoding the DNA binding domain of a known 
transcription factor {e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
15 fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a mGluRSM- 
dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
{e.g. , LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
20 transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the 
cloned gene which encodes the protein which interacts with the mGluRSM protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
25 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein {e.g., a mGluRSM modulating agent, an antisense 
mGluRSM nucleic acid molecule, a mGluR5M-specific antibody, or a mGluRSM- 
binding partner) can be used in an animal model to determine the efficacy, toxicity, or 
side effects of treatment with such an agent. Alternatively, an agent identified as 
30 described herein can be used in an animal model to determine the mechanism of action 
of such an agent. Furthermore, this invention pertains to uses of novel agents identified 
by the above-described screening assays for treatments as described herein. 

B. Detection Assays 
35 Portions or fragments of the cDNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
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genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 



5 1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the mGluR5M 
nucleotide sequences, described herein, can be used to map the location of the 
10 mGluRSM genes on a chromosome. The mapping of the mGluR5M sequences to 
chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, mGluRSM genes can be mapped to chromosomes by preparing PCR 
primers (preferably 15-25 bp in length) from the mGluR5M nucleotide sequences. 
1 5 Computer analysis of the mGluRSM sequences can be used to predict primers that do 
not span more than one exon in the genomic DNA, thus complicating the amplification 
process See e.g., Example 3, herein. These primers can then be used for PCR screening 
of somatic cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the mGluRSM sequences will yield an 
20 amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
C mammals (e.g., human and mouse cells). As hybrids of human and mouse cells grow 

and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
25 lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 
human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
30 chromosomes. (D'Eustachio P. et al (1983) Science 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular sequence to a particular chromosome. Three or more sequences can be 
35 assigned per day using a single thermal cycler. Using the mGluR5M nucleotide 
sequences to design oligonucleotide primers, sublocalization can be achieved with 
panels of fragments from specific chromosomes. Other mapping strategies which can 
similarly be used to map a 9o, lp, or lv sequence to its chromosome include in situ 
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hybridization (described in Fan, Y. et al (1990) PNAS, 87:6223-27), pre-screening with 
labeled flow-sorted chromosomes, and pre-selection by hybridization to chromosome 
specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
5 chromosomal spread can further be used to provide a precise chromosomal location in 
one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
10 can be identified individually. The FISH technique can be used with a DNA sequence 
as short as 500 or 600 bases. However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
JI;J for simple detection. Preferably 1,000 bases, and more preferably 2,000 bases will 

Q suffice to get good results at a reasonable amount of time. For a review of this 

[H 1 5 technique, see Verma et al , Human Chromosomes: A Manual of Basic Techniques 
y;| (Pergamon Press, New York 1988). 

HI Reagents for chromosome mapping can be used individually to mark a single 

s • § 

chromosome or a single site on that chromosome, or panels of reagents can be used for 
M marking multiple sites and/or multiple chromosomes. Reagents corresponding to 

n\ 20 noncoding regions of the genes actually are preferred for mapping purposes. Coding 
U sequences are more likely to be conserved within gene families, thus increasing the 

J* 1 chance of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
25 data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 

Man, available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al (1987) Nature, 325:783-787. 
30 Moreover, differences in the DNA sequences between individuals affected and 

unaffected with a disease associated with the mGluR5M gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
35 structural alterations in the chromosomes, such as deletions or translocations that are 

visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 



-52- 



ATTORNEY DOCKET NO.: GNN-024 



2. Tissue Typing 

The mGluR5M sequences of the present invention can also be used to identify 
individuals from minute biological samples. The United States military, for example, is 
5 considering the use of restriction fragment length polymorphism (RFLP) for 

identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 

10 difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the mGluR5M nucleotide sequences 

15 described herein can be used to prepare two PCR primers from the 5 1 and 3 r ends of the 
sequences. These primers can then be used to amplify an individual's DNA and 
subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 

20 unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
and from tissue. The mGluR5M nucleotide sequences of the invention uniquely 
represent portions of the human genome. Allelic variation occurs to some degree in the 
coding regions of these sequences, and to a greater degree in the noncoding regions. It 

25 is estimated that allelic variation between individual humans occurs with a frequency of 
about once per each 500 bases. Each of the sequences described herein can, to some 
degree, be used as a standard against which DNA from an individual can be compared 
for identification purposes. Because greater numbers of polymorphisms occur in the 
noncoding regions, fewer sequences are necessary to differentiate individuals. The 

30 noncoding sequences of SEQ ID NO: 1 , can comfortably provide positive individual 

identification with a panel of perhaps 10 to 1,000 primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ 
ID NO:3 are used, a more appropriate number of primers for positive individual 
identification would be 500-2,000. 

35 If a panel of reagents from mGluR5M nucleotide sequences described herein is 

used to generate a unique identification database for an individual, those same reagents 
can later be used to identify tissue from that individual. Using the unique identification 
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database, positive identification of the individual, living or dead, can be made from 
extremely small tissue samples. 

3. Use of Partial mGluRSM Sequences in Forensic Biology 
5 DNA-based identification techniques can also be used in forensic biology. 

Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 

10 skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 

1 5 enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Sequences targeted to noncoding regions of SEQ ID NO: 1 are 

20 particularly appropriate for this use as greater numbers of polymorphisms occur in the 
noncoding regions, making it easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the mGluRSM nucleotide sequences or 
portions thereof, e.g. , fragments derived from the noncoding regions of SEQ ID NO: 1 , 
having a length of at least 20 bases, preferably at least 30 bases. 

25 The mGluR5M nucleotide sequences described herein can further be used to 

provide polynucleotide reagents, e.g., labeled or labelable probes which can be used in, 
for example, an in situ hybridization technique, to identify a specific tissue, e.g., brain 
tissue. This can be very useful in cases where a forensic pathologist is presented with a 
tissue of unknown origin. Panels of such mGluR5M probes can be used to identify 

30 tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., mGluRSM primers or probes can be 
used to screen tissue culture for contamination (i.e. screen for the presence of a mixture 
of different types of cells in a culture). 

35 C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trails are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
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Accordingly, one aspect of the present invention relates to diagnostic assays for 
determining mGluRSM protein and/or nucleic acid expression as well as mGluR5M 
activity, in the context of a biological sample (e.g., blood, serum, cells, tissue) to thereby 
determine whether an individual is afflicted with a disease or disorder, or is at risk of 
5 developing a disorder, associated with aberrant mGluRSM expression or activity. The 
invention also provides for prognostic (or predictive) assays for determining whether an 
individual is at risk of developing a disorder associated with mGluR5M protein, nucleic 
acid expression or activity. For example, mutations in a mGluRSM gene can be 
assayed in a biological sample. Such assays can be used for prognostic or predictive 
10 purpose to thereby phophylactically treat an individual prior to the onset of a disorder 
characterized by or associated with mGluR5M protein, nucleic acid expression or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of mGluRSM in clinical trials. 
15 These and other agents are described in further detail in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or absence of mGluR5M 
protein or nucleic acid in a biological sample involves obtaining a biological sample 

20 from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting mGluRSM protein or nucleic acid (e.g., mRNA, genomic DNA) 
that encodes mGluRSM protein such that the presence of mGluR5M protein or nucleic 
acid is detected in the biological sample. A preferred agent for detecting mGluR5M 
mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to 

25 mGluR5M mRNA or genomic DNA. The nucleic acid probe can be, for example, a full- 
length mGluRSM nucleic acid, such as the nucleic acid of SEQ ID NO: 1, or a fragment 
or portion of a mGluRSM nucleic acid such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to mGluRSM mRNA or genomic DNA. Other suitable probes for 

30 use in the diagnostic assays of the invention are described herein. 

A preferred agent for detecting mGluRSM protein is an antibody capable of 
binding to mGluR5M protein, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a 
fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled", with regard to 

35 the probe or antibody, is intended to encompass direct labeling of the probe or antibody 
by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as 
well as indirect labeling of the probe or antibody by reactivity with another reagent that 
is directly labeled. Examples of indirect labeling include detection of a primary 
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antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA 
probe with biotin such that it can be detected with fluorescently labeled streptavidin. 
The term "biological sample" is intended to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. That 
5 is, the detection method of the invention can be used to detect mGluRSM mRNA, 

protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, 
in vitro techniques for detection of mGluRSM mRNA include Northern hybridizations 
and in situ hybridizations. In vitro techniques for detection of mGluRSM protein 
include enzyme linked immunosorbent assays (ELISAs), Western blots, 

10 immunoprecipitations and immunofluorescence. In vitro techniques for detection of 
mGluRSM genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of mGluR5M protein include introducing into a subject a 
labeled anti-mGluR5M antibody. For example, the antibody can be labeled with a 
radioactive marker whose presence and location in a subject can be detected by standard 

1 5 imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
test subject. Alternatively, the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test subject. A preferred biological 
sample is a serum sample isolated by conventional means from a subject. 

20 In another embodiment, the methods further involve obtaining a control 

biological sample from a control subject, contacting the control sample with a 
compound or agent capable of detecting mGluR5M protein, mRNA, or genomic DNA, 
such that the presence of mGluR5M protein, mRNA or genomic DNA is detected in the 
biological sample, and comparing the presence of mGluRSM protein, mRNA or 

25 genomic DNA in the control sample with the presence of mGluR5M protein, mRNA or 
genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of mGluRSM in a 
biological sample. For example, the kit can comprise a labeled compound or agent 
capable of detecting mGluR5M protein or mRNA in a biological sample; means for 

30 determining the amount of mGluRSM in the sample; and means for comparing the 
amount of mGluRSM in the sample with a standard. The compound or agent can be 
packaged in a suitable container. The kit can further comprise instructions for using the 
kit to detect mGluRSM protein or nucleic acid. 



35 2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
mGluRSM or mGluR expression or activity. For example, the assays described herein, 
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such as the preceding diagnostic assays or the following assays, can be utilized to 
identify a subject having or at risk of developing a disorder associated with mGluR5M 
protein, nucleic acid expression or activity such as a CNS or psychiatric disorder. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk 
5 for developing a CNS or psychiatric disorder. Thus, the present invention provides a 
method for identifying a disease or disorder associated with aberrant mGluRSM or 
mGluR expression or activity in which a test sample is obtained from a subject and 
mGluRSM protein or nucleic acid (e.g, mRNA, genomic DNA) is detected, wherein the 
presence of mGluRSM protein or nucleic acid is diagnostic for a subject having or at risk 
1 0 of developing a disease or disorder associated with aberrant mGluRSM or mGluR 
expression or activity. As used herein, a "test sample" refers to a biological sample 
obtained from a subject of interest. For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue, in particular a neuronal cell sample or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
1 5 whether a subject can be administered an agent (e.g. , an agonist, antagonist, 

peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant mGluR5M expression or activity. 
For example, such methods can be used to determine whether a subject can be 
effectively treated with an agent for a disorder, such as a CNS or psychiatric disorder. 
20 Alternatively, such methods can be used to determine whether a subject can be 
effectively treated with an agent for an inflammatory disease. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant mGluRSM or mGluR expression or 
activity in which a test sample is obtained and mGluR5M protein or nucleic acid 
25 expression or activity is detected (e.g., wherein the abundance of mGluR5M protein or 
nucleic acid expression or activity is diagnostic for a subject that can be administered the 
agent to treat a disorder associated with aberrant mGluR5M or mGluR expression or 
activity.) 

The methods of the invention can also be used to detect genetic alterations in a 
30 mGluR5M gene, thereby determining if a subject with the altered gene is at risk for a 

disorder characterized by an aberrant inflammatory response. In preferred embodiments, 
the methods include detecting, in a sample of cells from the subject, the presence or 
absence of a genetic alteration characterized by at least one of an alteration affecting the 
integrity of a gene encoding a mGluRSM-protein, or the mis-expression of the 
35 mGluR5M gene. For example, such genetic alterations can be detected by ascertaining 
the existence of at least one of 1) a deletion of one or more nucleotides from a 
mGluRSM gene; 2) an addition of one or more nucleotides to a mGluRSM gene; 3) a 
substitution of one or more nucleotides of a mGluRSM gene, 4) a chromosomal 
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rearrangement of a mGluR5M gene; 5) an alteration in the level of a messenger RNA 
transcript of a mGluRSM gene, 6) aberrant modification of a mGluR5M gene, such as of 
the methylation pattern of the genomic DNA, 7) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of a mGluRSM gene, 8) a non-wild type level of 
5 a mGluR5M-protein, 9) allelic loss of a mGluRSM gene, and 1 0) inappropriate post- 
radiational modification of a mGluR5M-protein. As described herein, there are a large 
number of assay techniques known in the art which can be used for detecting alterations 
in a mGluRSM gene. A preferred biological sample is a tissue or serum sample isolated 
by conventional means from a subject. 
10 In certain embodiments, detection of the alteration involves the use of a 

probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
jj*5 ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241 :1077-1080; 

O and Nakazawa et al (1994) PNAS 91:360-364), the latter of which can be particularly 

T 5 1 5 useful for detecting point mutations in the mGluR5M-gene (see Abravaya et al (1 995) 
yi Nucleic Acids Res .23:675-682). This method can include the steps of collecting a 

f \ a 

\H sample of cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) 

111 

5 from the cells of the sample, contacting the nucleic acid sample with one or more 

primers which specifically hybridize to a mGluRSM gene under conditions such that 
m 20 hybridization and amplification of the mGluR5M-gene (if present) occurs, and detecting 
H the presence or absence of an amplification product, or detecting the size of the 

? ;: f amplification product and comparing the length to a control sample. It is anticipated 

that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 
25 Alternative amplification methods include: self sustained sequence replication 

(Guatelli, J.C. eta!., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, 1989, Proc. Natl Acad. Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et all, 1988, Bio/Technology 6:1 197), or any 
other nucleic acid amplification method, followed by the detection of the amplified 
30 molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in a mGluRSM gene from a sample cell 
can be identified by alterations in restriction enzyme cleavage patterns. For example, 
35 sample and control DNA is isolated, amplified (optionally), digested with one or more 
restriction endonucleases, and fragment length sizes are determined by gel 
electrophoresis and compared. Differences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA. Moreover, the use of sequence 
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specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can be used to score 
for the presence of specific mutations by development or loss of a ribozyme cleavage 
site. 

In other embodiments, genetic mutations in mGluRSM can be identified by 
5 hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of oligonucleotides probes (Cronin, M.T. et al 
(1996) Human Mutation 7: 244-255; Kozal, MJ. et al (1996) Nature Medicine 2: 753- 
759). For example, genetic mutations in mGluR5M can be identified in two 
dimensional arrays containing light-generated DNA probes as described in Cronin, M.T. 

10 et al. supra. Briefly, a first hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify base changes between the 
sequences by making linear arrays of sequential ovelapping probes. This step allows the 
identification of point mutations. This step is followed by a second hybridization array 
that allows the characterization of specific mutations by using smaller, specialized probe 

15 arrays complementary to all variants or mutations detected. Each mutation array is 

composed of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the mGluRSM gene and detect mutations by 

20 comparing the sequence of the sample mGluRSM with the corresponding wild-type 
(control) sequence. Examples of sequencing reactions include those based on 
techniques developed by Maxim and Gilbert ((1977) PNAS 74:560) or Sanger ((1977) 
PNAS 74:5463). It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays ((1995) Biotechniques 

25 19:448), including sequencing by mass spectrometry (see, e.g., PCT International 

Publication No. WO 94/16101; Cohen et al {\996)Adv. Chromatogr. 36:127-162; and 
Griffin al (1993) Appl Biochem. Biotechnol 38:147-159). 

Other methods for detecting mutations in the mGluR5M gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 

30 or RNA/DNA heteroduplexes (Myers et al (1985) Science 230:1242). In general, the 
art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 
hybridizing (labeled) RNA or DNA containing the wild-type mGluR5M sequence with 
potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent which cleaves single-stranded regions of the duplex 

35 such as which will exist due to basepair mismatches between the control and sample 
strands. For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA 
hybrids treated with SI nuclease to enzymatically digesting the mismatched regions. In 
other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
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hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then 
separated by size on denaturing polyacrylamide gels to determine the site of mutation. 
See, for example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al 
5 (1992) Methods Enzymol 217:286-295. In a preferred embodiment, the control DNA or 
RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
1 0 mutations in mGluR5M cDNAs obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase 
from HeLa cells cleaves T at G/T mismatches (Hsu et al (1994) Carcinogenesis 
15:1657-1662). According to an exemplary embodiment, a probe based on a 
mGluR5M sequence, e.g., a wild-type mGluR5M sequence, is hybridized to a cDNA or 
1 5 other DNA product from a test cell(s). The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be detected from electrophoresis 
protocols or the like. See, for example, U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in mGluR5M genes. For example, single strand conformation 
20 polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids (Orita et al (1989) Proc Natl Acad. Sci 
USA: 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) 
Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and control 
mGluRSM nucleic acids will be denatured and allowed to renature. The secondary 
25 structure of single-stranded nucleic acids varies according to sequence, the resulting 

alteration in electrophoretic mobility enables the detection of even a single base change. 
The DNA fragments may be labeled or detected with labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a change in sequence. In a preferred embodiment, the 
30 subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic mobility (Keen et al (1991) Trends 
Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
35 gradient gel electrophoresis (DGGE) (Myers et al (1985) Nature 3 1 3 :495). When 

DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 



-60- 



ATTORNEY DOCKET NO.: GNN-024 



used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
5 primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al (1986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
1 0 mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
1 5 interest in the center of the molecule (so that amplification depends on differential 

hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3 f 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 
to introduce a novel restriction site in the region of the mutation to create cleavage-based 
20 detection (Gasparini et al (1992) Mol Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5' sequence making it possible to detect 
the presence of a known mutation at a specific site by looking for the presence or absence 
25 of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving a 
30 mGluR5M gene. 

Furthermore, any cell type or tissue in which mGluRSM is expressed may be 
utilized in the prognostic assays described herein. 

3. Monitoring of Effects During Clinical Trials 
35 Monitoring the influence of agents {e.g. , drugs, compounds) on the expression or 

activity of a mGluR5M protein {e.g., modulation of a CNS or psychiatric responese) an 
be applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase 
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mGluRSM gene expression, protein levels, or upregulate mGluR5M or mGluR activity, 
can be monitored in clinical trails of subjects exhibiting decreased mGluR5M gene 
expression, protein levels, or downregulated mGluR5M or mGluR activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
5 mGluRSM gene expression, protein levels, or downregulate mGluRSM or mGluR 
activity, can be monitored in clinical trails of subjects exhibiting increased mGluRSM 
gene expression, protein levels, or upregulated mGluR5M or mGluR activity. In such 
clinical trials, the expression or activity of a mGluRSM gene, and preferably, other 
genes that have been implicated in, for example, an inflammatory disorder can be used 

10 as a "read out" or markers of the phenotype of a particular cell. 

For example, and not by way of limitation, genes, including mGluR5M, that are 
modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 
which modulates mGluRSM activity (e.g., identified in a screening assay as described 
herein) can be identified. Thus, to study the effect of agents on CNS or psychiatric 

1 5 disorders, for example, in a clinical trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of mGluR5M and other genes implicated in the 
CNS or psychiatric disorder, respectively. The levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the 

20 methods as described herein, or by measuring the levels of activity of mGluRSM or 

other genes. In this way, the gene expression pattern can serve as a marker, indicative of 
the physiological response of the cells to the agent. Accordingly, this response state 
may be determined before, and at various points during treatment of the individual with 
the agent. 

25 In a preferred embodiment, the present invention provides a method for 

monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) comprising the steps of (i) 
obtaining a pre-administration sample from a subject prior to administration of the 

30 agent; (ii) detecting the level of expression of a mGluRSM protein, mRNA, or genomic 
DNA in the preadministration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the 
mGluRSM or mGluR protein, mRNA, or genomic DNA in the post-administration 
samples; (v) comparing the level of expression or activity of the mGluR5M or mGluR 

35 protein, mRNA, or genomic DNA in the pre-administration sample with the mGluR5M 
protein, mRNA, or genomic DNA in the post administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent may be desirable to increase the expression or 
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activity of mGluRSM to higher levels than detected, i.e., to increase the effectiveness of 
the agent. Alternatively, decreased administration of the agent may be desirable to 
decrease expression or activity of mGluRSM to lower levels than detected, Le. to 
decrease the effectiveness of the agent. According to such an embodiment, mGluRSM 
expression or activity may be used as an indicator of the effectiveness of an agent, even 
in the absence of an observable phenotypic response. 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant mGluRSM or mGluR expression or activity. With regards to both 
prophylactic and therapeutic methods of treatment, such treatments may be specifically 
tailored or modified, based on knowledge obtained from the field of pharmacogenomics. 
"Pharmacogenomics", as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 
how a patient's genes determine his or her response to a drug (e.g., a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 
invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the mGluRSM molecules of the present invention or mGluR5M 
modulators according to that individual's drug response genotype. Pharmacogenomics 
allows a clinician or physician to target prophylactic or therapeutic treatments to patients 
who will most benefit from the treatment and to avoid treatment of patients who will 
experience toxic drug-related side effects. 

1 . Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant mGluRSM or mGluR expression or 
activity, by administering to the subject a mGluRSM or an agent which modulates 
mGluRSM expression or at least one mGluRSM or mGluR activity. Subjects at risk for 
a disease which is caused or contributed to by aberrant mGluR5M expression or activity 
can be identified by, for example, any or a combination of diagnostic or prognostic 
assays as described herein. Administration of a prophylactic agent can occur prior to the 
manifestation of symptoms characteristic of the mGluRSM aberrancy, such that a 
disease or disorder is prevented or, alternatively, delayed in its progression. Depending 
on the type of mGluRSM aberrancy, for example, a mGluRSM, mGluRSM agonist or 
mGluR5M antagonist agent can be used for treating the subject. The appropriate agent 
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can be determined based on screening assays described herein. The prophylactic 
methods of the present invention are further discussed in the following subsections. 

2. Therapeutic Methods 
5 Another aspect of the invention pertains to methods of modulating mGluRSM 

expression or activity for therapeutic purposes. Accordingly, in an exemplary 
embodiment, the modulatory method of the invention involves contacting a cell with a 
mGluRSM molecule of the present invention such that the activity of a mGluR5M is 
modulated. Alternatively, the modulatory method of the invention involves contacting a 
10 cell with an agent that modulates one or more of the activities of mGluRSM protein 

activity associated with the cell. An agent that modulates mGluR5M protein activity can 
be an agent as described herein, such as a nucleic acid or a protein, a naturally-occurring 

H target molecule of a mGluRSM protein, a mGluRSM antibody, a mGluRSM agonist or 

t n 

gi antagonist, a peptidomimetic of a mGluR5M agonist or antagonist, or other small 

III 15 molecule. In one embodiment, the agent stimulates one or more mGluR5M activites. 
i f l Examples of such stimulatory agents include active mGluRSM protein and a nucleic 

RJ acid molecule encoding mGluR5M that has been introduced into the cell. In another 

embodiment, the agent inhibits one or more mGluRSM activites. Examples of such 
inhibitory agents include antisense mGluR5M nucleic acid molecules and anti- 
20 mGluR5M antibodies. These modulatory methods can be performed in vitro {e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g, by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a mGluRSM protein or nucleic acid molecule. In one embodiment, the 
25 method involves administering an agent (e.g., an agent identified by a screening assay 
described herein), or combination of agents that modulates (e.g., upregulates or 
downregulates) mGluRSM expression or activity. In another embodiment, the method 
involves administering a mGluRSM protein or nucleic acid molecule as therapy to 
compensate for reduced or aberrant mGluRSM expression or activity. 
30 Stimulation of mGluRSM activity is desirable in situations in which mGluR5M 

is abnormally downregulated and/or in which increased mGluRSM activity is likely to 
have a beneficial effect (e.g., in situations where mGluR activity is abnormally 
upregulated or increased, for example, in a CNS or psychiatric disorder). Likewise, 
inhibition of mGluRSM activity is desirable in situations in which mGluR5M is 
35 abnormally upregulated and/or in which decreased mGluRSM activity is likely to have a 
beneficial effect. 
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3. Pharmacogenetics 

The mGluRSM molecules of the present invention, as well as agents, or 
modulators which have a stimulatory or inhibitory effect on mGluRSM activity {e.g., 
mGluR5M gene expression) as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylactically or therapeutically) disorders (e.g, in 
CNS or psychiatric disorders) associated with aberrant mGluR5M activity. In 
conjunction with such treatment, pharmacogenomics (/. e. , the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
or drug) may be considered. Differences in metabolism of therapeutics can lead to 

10 severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer a mGluR5M molecule or mGluRSM modulator as 
well as tailoring the dosage and/or therapeutic regimen of treatment with a mGluR5M 

1 5 molecule or mGluRSM modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 
persons. See e.g., Eichelbaum, ML, Clin Exp Pharmacol Physiol, 1996, 23(10-1 1) :983- 
985 and Linder, M.W., Clin Chem, 1997, 43(2):254-266. In general, two types of 

20 pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body (altered drug action) or genetic 
conditions transmitted as single factors altering the way the body acts on drugs (altered 
drug metabolism). These pharmacogenetic conditions can occur either as rare genetic 
defects or as naturally-occurring polymorphisms. For example, glucose-6-phosphate 

25 dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the 

main clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 

30 map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
significant number of patients taking part in a Phase II/III drug trial to identify markers 

35 associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
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DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
5 individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 

10 encodes a drugs target is known (e.g., a mGluR5M protein or mGluRSM protein of the 
present invention), all common variants of that gene can be fairly easily identified in the 
population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

15 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acety transferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

20 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

25 experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 

30 metabolism has been identified to be due to CYP2D6 gene amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized to 
identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a drug (e.g., a mGluR5M molecule or mGluR5M modulator of the 
present invention) can give an indication whether gene pathways related to toxicity have 

35 been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
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dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a mGluRSM 
molecule or mGluRSM modulator, such as a modulator identified by one of the 
exemplary screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 
References throughout the instant specification to websites maintained as part of the 
World Wide Web are referred to herein by the prefix http://. The information contained 
in such websites is publically-avialable and can be accessed elctronically by contacting 
the cited address. 
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EXAMPLES 

Example 1; Identification and Characterization of a mGluR5M cDNA 

In this example, the identification and characterization of the genes encoding 
5 human mGluR5M (also referred to interchangeably herein as " YI1 76") is described. 

Isolation of the human mGluR5M cDNAs 

A full-length clone designated YI176 was identified in a cDNA library derived 

from adult human brain mRNA (Clonetech™). (A partial YI176 cDNA was also 
10 isolated from a Clonetech™ hippocampus cDNA library.) The human clone contained 

an insert of approximately 1 823 bp containing a protein-encoding sequence (/". e. , an open 

reading frame) of approximately 1110 nucleotides capable of encoding approximately 

369 amino acids of mGluR5M. 

The nucleotide sequence encoding the human mGluRSM protein is shown in 
15 Figure 1 and is set forth as SEQ ID NO:l . The full length protein encoded by this 

nucleic acid includes about 369 amino acids and has the amino acid sequence shown in 

Figure 1 and set forth as SEQ ID NO:2. The coding portion (open reading frame) of 

SEQ ID NO:l is set forth as SEQ ID NO:3. 

20 Analysis of Human mGluRSM 

A BLAST search (Altschul et al (1990) J. Mol. Biol. 215:403) of the amino acid 
sequence of human mGluRSM revealed that mGluRSM is significantly similar to the N- 
terminus of the human metabotropic glutamate receptor 5 (mGluR5). In particular, the 
mGluRSM protein identified has a N-terminus which is nearly identical to the N- 

25 terminal extracellular binding domain of the mGluRS protein (e.g., SwissProt™ 

Accession No. P41594). Amino acids 1-303 of mGluR5M are -97% identical to the N- 
terminal extracellular domain of mGluRSa and/or mGluRSb. By contrast, amino acids 
304-369 are unique as determined by BLAST analysis. 

A more recent BLAST search identified a cDNA termed RNF18 (a testis-specific 

30 ring finger protein having GenBank Accession No. AB037682 (Yoshikawa et al (2000) 
BBA 1493:349-355)) as sharing identity with YI176 at the 3' end. Moreover, ESTs have 
been reported which contain sequence that spans the YI176/mGluR5 region 
(GBESTHUM3_REL:BE674422and GBESTHUM3_REL:BE467477), evidencing that 
YI176 represents a true mRNA (in contrast to a library artifact, for example, an mGluRS 

35 artifact). 

A Prosite search resulted in the identification of a G protein receptor family 31 
consensus sequence at about amino acids 158-176 of SEQ ID NO:2. mGluR5M is also 
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I'll 



predicted to contain Asn glycosylation sites at about amino acids 88-91 and 210-213 of 
SEQ ID NO:2. 



5 Tissue Distribution of mGluR5M mRNA 

This Example describes the tissue distribution of mGluR5M mRNA, as 
determined by Northern blot analysis. 

Northern blot hybridizations with various RNA samples were performed as 
follows. Clonetech Human Multiple Tissue Northerns™ and Human Multiple Tissue 
10 Array™ membranes were probed using the manufacturer's protocols and 32 P-labeled 
DNA probes. Probes containing YI176-specific sequence (nucleotides 959-1 103 and 
{** nucleotides 1481-1 846 of SEQ ID NO: 1 ) were generated as follows. Plasmid DNA 

from an isolated YI176 colony was prepared using the QIAprep Spin Miniprep™ Kit 
and manufacturer's protocol. Subsequently, the DNA was restriction digested with PstI 
1 5 and NotI or Bglll and Apal according to the manufacturer's instructions. Restriction 
fragments were size-fractionated by gel electrophoresis on 1.5% agarose, 0.1 |ug/ml 
Uj ethidium bromide, 1 x gels (Maniatis et al. 7 1982). The ethidium-bromide stained DNA 

l mim band of the appropraite size (Pstl/NotI -365 bp; Bglll/Apal -144 bp) was excised from 

;f|J the agarose gel. Next, the DNA was extracted from the agarose using the Clonetech 

r y 20 NucleoSpin™ Nucleic Acid Purification Kit and manufacturer's protocol. The extracted 
hj DNA was labeled with Redivue™ (a 32 P)dCTP using the Prime-It II™ Random Primer 

H» Labeling Kit and protocol. Unincorporated (a 32 P)dCTP was removed with Amersham's 

NICK™ column and protocol. Membranes were prehybridized to block non-specific 
binding interactions and subsequently hybridized with an appropriate aliquot of the 32 P- 
25 labeled YI176 probe under standard conditions. After hybridization and washing, 
membranes were air dried, exposed to X-ray film and subsequently developed and 
analyzed. 

No bands were detected upon analysis of the Multiple Tissue Northerns™. 
However, the Multiple Tissue Array™ membranes indicated that a YI176 is expressed 

30 in neural but not heart or other non-neural tissues. These data in addition to those 

obtained from further Northern analysis indicate that mGluR5M mRNAis detectable in 
whole brain, cerebral cortex, frontal lobe, parietal lobe, occipital lobe, temporal lobe, 
paracentral gyrus of cerebral cortex, pons, cerebellum, corpus callosum, amygdala, 
caudate nucleus, hippocampus, medulla oblongta, putamen, substantia nigra, accumbens 

35 nucleus, thalamus and fetal brain. Notably, expression of mGluR5M is prdominant in 
cells and/or tissues of the central nervous system. 
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Example 2: Identification and Characterization of a mGIuRSM cDNA 

In this example, the chromosomal mapping of the gene encoding human 
mGIuRSM (also referred to interchangeably herein as "YI176 H ) is described. 

5 The chromosomal location of YI176 and mGluRS was determined by BLAST™ 

searching of the HTGS databases using the appropriate relative query sequences. These 
analyses indicated that mGluR5 maps to BACs assigned to chromosome 1 1 and that 
YI176 maps to BACs assigned to chromosome 1 1 and chromosome 3. Of particular 
interest is the fact that at least some of the YI176-encoding BACs map to a region of 

10 chromosome 1 1 that has recently been identified as one of the regions affected in a 
balanced translocation event associated with schizophrenia and related psychiatric 
disorders (Millar et al (2000) Hum. Mol Genet 9:1415-1423). In particular, Millar and 
co-workers described a balanced (1;1 1)(q42.1;ql4.3) translocation which segregates 
with schizophrenia and related psychiatric disorders in a large Scottish family studied. 

1 5 Millar and co-workers analyzed the affected region on chromosome 1 and, in particular, 
described two novel genes, Disrupted-In-Schizophrenia 1 and 2 (DISCI and DISC2\ 
believed to play a role in nervous system function and/or susceptibility to psychiatric 
illness. Millar and co-workers described the affected chromosome 1 1 region as being 
characterized by a dearth of genes in the breakpoint region, concluding that it is unlikely 

20 that the expression of any genes on this chromosome are affected by the translocation. 
The above-described chromosimal mapping data (e.g.^ radiation hybrid mapping), the 
specific expression patterns of YI176 mRNA, as well as genomic predictions indicate 
the gene as a candidate for schizophrenia and/or other psychiatric disorders. 

25 Example 3: Radiation Hybrid Mapping of YI176 

Radiation Hybrid mapping was performed using the GeneBridge 4 Radiation 
Hybrid Panel (catalog # RH02.02 Research Genetics, Inc) and manufacturer's protocol. 
Briefly, polymerase chain reaction (PCR) amplification with the primers 
5TGCTGCTGCACATGCCCC and 5TTAGATGAGCCTGTCCCTCAGTCC was 

30 performed using template DNA from each of the somatic hybrid clones. A reaction 

mixture was compiled with components at the following final concentrations: 50ng of 
DNA, 8 pmol of each; 0.2mM each dATP, dTTP, dCTP, and dGTP (Amersham 
Pharmacia); one unit AmpliTaqGold™ polymerase; lx reaction buffer (Applied 
Biosystems); 2.5 mM MgC12. The mixture was incubated at 95°C for ten minutes, 

35 followed by 30 cycles of 94°C 30 seconds, 65°C 30 seconds, 72°C 23 seconds 

(MJResearch DNA Engine Tetrad PTC-225). PCR amplification products were size- 
fractionated on 3% agarose, lx TAE gels (Maniatis et al). Each somatic hybrid clone 
was scored for the presence or absence of the PCR product. The results were submitted 
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to http://www-genome.wi.mit.edxi/cgi-bin/contig/rhmapper.pl or 

http://www.sanger.ac.uk/Software/^ for RH mapping. The MIT 

mapping server assigned the gene to chromosome 1 1, 31 .33 cR from the marker 
Dl 1S1350. The Sanger mapping server assigned the gene to chromsome 1 1 between the 
5 markers AFMal 3 1 xd5 and AFM344zg 1 . 
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Example 4: Expression of Recombinant YI176 Protein in HEK293 Cells 

YI176 cDNA was stably transfected into HEK293 cells using the Invitrogen™ 
10 FLP-IN system according to manufacturer's instructions. YI176 protein (epitope tagged 
for detection) was detectable in the medium confirming that the protein is secreted, as 
was predicted from sequence analysis. 

Example 5: Heterodimer Formation of Recombinant YI176 Protein with mGluR5 

1 5 Transient co-transfections into HEK293 cells were performed using the 

Lipofectamine Plus™ system (InVitrogen) according to manufacturer's instructions. 
Sample 1 : Cells co-transfected with a cDNA encoding a V5 -tagged secreted form of 
mGluR5 (comprising mGluR5 residues 1-576 which includes the extracellular domain 
but lacks the transmembrane domains), referred to herein as rtV5, and full-length (FL) 

20 mGluR5. Sample 2: Cells transfected with rtV5 alone. Sample 3: Cells transfected with 
YI176-V5 (tagged YI176) alone. Sample 4: Cells co-transfected with YI176-V5 and FL 
mGluR5. Sample 5: Cells transfected with FL mGluR5 alone. Sample 6: Mock- 
transfected cells. 

Cell membrane fractions were prepared from the transfected cells in accordance 
25 will well-established protocols, i.e., cell lysis followed by washing and centrifugation 
steps to separate cytosolic, nuclear, and membrane fractions. Proteins were 
immunoprecipitated from cell membrane fractions with an anti-mGluR5 antibody, then 
separated by SDS polyacrylamide gel electrophoresis. Western blot analysis was 
performed using an anti-V5 tag-specific antibody. The results are set forth in Table 1 . 

Table 1: IP with anti-mGluR5 antibody; Western blot with anti-V5 antibody 

rtV5+ YI176-V5 + 

FLmGluR5 rtV5 YI176-V5 FL mGluRS FL mGluRS MOCK 

V5 tagged 

mGluR5 +++++ 

YI176-V5 ++++ 

V5-tagged (secreted) mGluR5:FL mGluRS complexes were detected in Sample 
35 1, indicating dimerization of the tagged (secreted) mGluR5 with FL mGluR5 receptor 
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expressed in cell membranes. Notably, YI176:FL mGluRS complexes were similarly 
detected in Sample 4, demonstrating that YI176 is able to dimerize with FL mGluR5 
expressed in cell membranes. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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